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Erythropoietin Muteins With Enhanced Activity 

This invention was made with Government Support under grant nos. 
R01-HL42949 and R01-GM39900 sponsored by the National Institute of 
Health. The Government has certain rights in the invention. 

S Cross-Reference to Related Applications 

This application is a continuation-in-part of U.S. Patent Application 
Serial No. 08/049,802 filed April 21, 1993 which is incorporated herein by 
reference. 

Background 

10 This invention relates to eiythiopoietin in general and more particulariy 

to modified eiythnqx>ietin proteins (erythropoietin muteins) having improved 
biological activity. 

Erythropoietin (hereafter referred to as Epo) is a glycopiotein hormone 
^ which, in its mature form in humans, is 166 amino acids long widi a 
IS molecular weight of 34 to 38 kD (Jacobs et al., Natm 313: 806-809 (198S). 
Epo -occurs in a, fi, and aaalo fonns wludi differ slighdy in didr 
caibcAydrate oompodtira and bidogical activity (Doidal etal., Bubcrinology 
116: 2293-2299 (198S)). 

In tenns of bicdogical function, Bpo has been leopgnized as die 
20 hematopoietic cytddne diat legulates die process of red blood cell production, 
known as eryOiropoieas. Eiydiropoiess is a controlled physiological process 
which normally produces red blood ceils in numbers which do not impede 
blood flow, but whidi are sufficient for oxygen transport 

The binding of Epo to its cognate receptor (D*Andrea, A.D„ et ai, 
25 Cell 57:277-285 (1989)) on ciydiroid precursor cells in die bone marrow 
results in salvaging diese cells from programmed cell deadi, known as 
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apoptosis (Koury, MJ., et ai, Science 248:378-381 (1990)), allowing them 
to proliferate and differentiate into circulating erythrocytes (red blood cells). 
Epo thus facilitates the forniation of erythrocytes from erythroid precursor 
cells. 

5 The level of Epo in circulation normally controls the rate of red blood 

cell production in the body. Typically, Epo is present at a low plasma 
concentration sufficient to maintain a steady state concentration of red blood- 
cells by stimulating formation of just enough red blood cells to replace those 
lost through normal processes, such as aging. However, when an increase in 

10 the production of red blood cells is needed, the amount of Epo in circulation 
is increased. This red blood cell deficit may be caused, for example, by 
anemia, the general loss of blood through hemorrhage, over-exposure to 
radiation, prolonged unconsciousness, or exposure to high altitudes where 
oxygen intake is reduced. 

IS In miammals, Epo is produced in the fetal liver and adult kidney, 

circulates in the bloodstream, and binds to receptors on committed progenitor 
cells in the bone marrow and other hematopoietic tissues, resulting in 
proliferation and terminal maturation of eiydiroid cells (Jelkmann, W., 
Physiol. Rev. 72:449 (1992)). The expression of Epo^ botii mRNA and 

20 protein, is maricedly increased by hypoxia, owing tt> a 3' enhancer and to 
highly oonsoved dements in tiie promoter region ^emenza, G.L eial., 
Prvc. Natl. Acad. Sd. VSA AS:S680-1684 (1988); Beck. I. etal., J. Kol. 
Oim. 266:15563-15566 (1991); Pugh, C.W. etd., Proc. Nad. Acad. Sd. 
USA 8S:1QS5MQS57: Blanchaid, K.L. et al.. Ma. CettBiol. /2:5373>538S 

25 (1992)). Tins el^am servomedianism enables Epo to regulate die red odl 
mass of man and otiier animals. 

Genes encoding Epo from a number of species have been studied. 
Thus br, the Epo genes of man (Jacdbs, K. esr al.^ Nature 575:806-10 (1985); 
Un, F.-K. etal., Proc. Natl. Acad. Sd. USA «2:7580-7584 (1985)), a 

30 monkey (Macacafasdcularis) (Un, F.-K. ei al. , Gene 4^:201-209 (1986)) and 
a rodent, die mouse (McDonald, J.D. a al., Mol. & Cell Bol. 6:842-848 
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(1986); Shoemaker, C.B, et aL, MoL A Cell Biol. 6:849-858 (1986)) have 
been cloned, sequenced, and expressed. There is a high degree of sequence 
homology in the coding region of the mature secreted proteins from these 
species. 

S In particular, the gene encoding human Epo has been cloned and used 

to produce recombinant Epo in cell culture, Recombinantly produced human 
Epo has genera] utility as a substitute for Epo isolated from natural sources. 
In addition, recombinant Epo has facilitated the production of large amounts 
of Epo free from deleterious substances associated with naturally derived Epo, 

10 which has allowed for greater availability and utility of this protein. 

Recombinantly produced Epo has proven especially useful for the 
treatment of patients suffering from impaired red blood cell production 
(Phyacians Desk Reference (PDR), 1993 edition, pp 602-6QS). Recombinant 
Epo has proven effective in treating anemia associated with chronic renal 

IS failure and HIV-infected individiuds suffering from lowered endogenous Epo 
levels related to ihtTzpy widi 2dovudine (AZD (See FDR, 1993 edition, at 
page6Q2). 

Modifications of die Epo protein whidi would inq>rove its utility as a 
tool for diagnosis or treatment of blood disorders are certainly desirable. In 

20 particular, modified forms of Epo exlubitii^ eidianoed biological activity 
woul^ be more effective and effidem dian native Epo in die dieiapy settiqg 
when it is necessary to administer Epo to die patient, enai)liqg administration 
less fiequendy dnd/or at a lower dose. Admimstiation of reduced amounts of 
Epo would also presumably reduce die risk of adverse effects assodated widi 

25 1^ treatment, audi as hypertoision, seizures, beadadies, etc. (See PDR, 
1993 edition, at pp. 603-604). 

Unfortunately, av^lable information regarding die structure of die Epo 
enzyme and its function does not permit die accurate prediction of 
modifications which would result in such improvements. In fact, die highly 

30 conserved nature of die amino acid sequence of Epo indicates diat diis protein 
is, for die most part, resistant to change and diat most modifications would be 
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expected to have deleterious effects on Epo and its activity (See U.S. Patent 
No. 4,835,260 by Shoemaker), 

Some speculation does, however, exist regarding the possibility of 
creating variants of Epo with altered characteristics that would be preferable 
5 to the wild type protein for particular purposes. A number of modifications 
of Epo have been proposed, including deletions, additions, and substitution of 
existing amino acids. These modifications have been prqwsed as a means of 
achieving a variety of effects, including separation of various Epo functions 
on individual protein fragments, improving the efficacy of recombinant Epo 

10 production, improving in vivo stability, etc. Of particular relevance to the 
present invention are those modifications proposed to improve the biological 
activity (i.e. ability to stimulate red blood cell production) of Epo. 

U.S. Patent No. 4,703,008 by Lin, F-K. (hereinafter referred to as 
"the '008 patent") speculates about a wide variety of modifications of Epo, 

IS including addition, deletion, and substitution analogs of Epo. The '008 patent 
does not indicate that any of the suggested modifications would increase 
biological activity per se, although it is stated that deletion of glycosylation 
sites might increase the activity of Epo produced in yeast (See the '008 patent 
. . at column 37, lines 25-28). Also, the *008 patent speculates that Bpo analogs 

20 which have one or more tyrosine residues replaced with pbenylalaiune may 
exh^iit an increased or decreased receptor bindiqg affinity. 

Australian Patent A|q)lication No. AU-A-S9145/90 by Fibl, M ei d. 
(herein refmed to as *Fibi") also discusses a numb^ of modified Epo piotdns 
(^muteins). Hln generally speculates about die alteiadon of amino adds 

25 10-55, 70-85, and 130-166 of 1^. In pardcular, additions of positively 
chained basic amino acids in the caiboxyl terminal region are purported to 
increase the biological activity of 1^. 

U.S. Patent No. 4,835,260 by Shoemaker, C. B. discusses modified 
Bpo proteins witii amino acid substinitions of die methionine at position 54 and 

30 asparagine at position 38. Such Epo muteins are tiumght to have improved 
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Stability but are not proposed to exhibit any increase in biological activity 

relative to wild type Epo. 

Beyond the few proposed modifications that have been speculated to 

enhance the biological activity of Epo, the art provides no further guidance 
5 regarding how Epo can be modified to increase its biological activity. This 

lack of guidance indicates that, based upon the present state of knowledge, 

other modifications of Epo which may increase biological activity cannot be 

predicted and may not even exist. 

The present invention represents the inventors' achievement in 
10 overcoming the lack of guidance and unpredictability regarding modifications 

of the Epo protein which increase its biological activity. 

Summary Of The Invention 

It is tbdrefore an object of the present invention to provide Epo mutdns 
with enhanced biological actiidty. 
15 It is a further object of tiie present invention to provide recombinant 

DNA encoding Epo muteins and methods of producing biologically active Epo 
- muteins in a host cell culture usii% such recombinant DNA. 

it is a fuitiier object of die present invention to pnmde nediods of 
treating blood discmters invol^g aimomially low red blood oell pq>ulations 
20 ustqgl^ muteins at dosages lower tiiantimtreqiured for unnuxfified ^ to 
achieve die same affect 

It is a fiirdier object of die present invention to proidtfe a mediod of 
obtuiuqg additional Ejpo mutdns wiih enhanced biological acti^. 
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Brief Description Of The Drawings 

Figure 1. Oligonucleotide Primer Sequences. Oligonucleotide 
primers corresponding to Epo DNA sequences are shown. The localization 
of the primer coincides with previously published nucleotide sequences for the 
5 human (Lin, F,-K. et ai, Proc, Nail. Acad. Sci. USA 52:7580-7584 (1985)) 
and murine (McDonald, J.D. ei al.MoL & Cell Biol. 6:842-848 (1986)) Epo 
genes. EX2R and EX5 are not completely conserved between human and 
mouse (respectively 93% and 95% identity). An equal amount of each 
possible nucleotide was incorporated during the corre^nding cycles of those 

10 primer syntheses. EX2R and NCOl are reverse primers and their sequences 
represent the antisense DNA strand. 

F^ure 2. PCR Strategy used for the doning of mammalian 
cDNAs containing the compfete coding sequence of the mature 
Eiythropoietui IVotdn. (knomic amplification and sequendqg of exonic 

IS fragments, localized upstream and downstream from the nucleotide portion 
coding for the mature protein, allowed the design of species sped&c primers 
(SP). Utilization of those SP primers and/or of, sequences that are 100% 
conserved between man and mouse S' ATG and 3' NCOl primers (Figure 1) 
on cDNA tenqilates prepared fiom kidney of uiunduced or hypoxta*induoed 

20 animals, allows the anq>lificatiQn of a laige variety of mammalian Bfo clones. 
Set^ H and antisense («-) primers are rq)resented by the arrows. Dashed 
boxes correspond to die coding part propeptide and mature prot»n) of tiie 
five^exons. Gray boxes rq)resent the S' and 3' untranslatBd regions. 
Figure 3, Comparison of IV1/EX2R sequences from various 

25 mammals. The numbering corresponds to die published human genomic 
sequence (Lin, F.-K. etal., Proc. Natl Acad. Sd. USA 82:7580-7584 
(1985)). The reported human sequence was obtained from die amplification 
of purified genomic DNA from Hep3B cells and agreed with die sequence 
preWously reported. The mouse sequence is from McDonald, J.D. et al.^ 

30 Md. & CeUBoL 6:842-848 (1986). The horse sequence was obtained from 
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kidney-extracted genomic DNA. All the other sequences were established 
from PCR amplification of genomic DNAs purified from several mammalian 
renal-derived cell lines. The consensus sequence indicates the positions where 
a unique nucleotide was found in all the reported species. The boundary 
5 between iniron 1 and exon 2 is represented by the ascending arrows. The 
locations of the two PCR primers, IVl and EX2R are shown by the dashed 
arrows. 

Figure 4. Alignment of the nucleotide sequences of mammalian 
Epo cDNAs. The mouse sequence corresponds to the one previously reported 

10 (McDonald, J.D. et al. , MoL & Cell BioL 6:842-848 (1986)). PCR-produced 
human sequence was obtained from hypoxia-induced Hep3B cell line and is in 
total agreement with the previously published sequence (Jacobs, K. etcL, 
Nature 5Ji:806-810 (1985); Lin. F.-K. e/u/., Proc, Natl Acad. Sci. USA 
82:7580-7584 (1985)). Monkey (Macaca mulatta), rat, sheep, pig and cat 

15 were amplified from kidney-purified cDNAs. Amplifications of the human, 
monkey, rat and sheep were realized using the ATG (residues 1 to 20) and 
NCOl (residues 723-742) primers. The line under residues 41 to 56 
represents the ^ecific SPl primer used for the cloning of the pig and cat 

. ^ sequences. For these two mammals, 5' sequences for SPl are derived from 

20 the data presented in Hguie 3. Arrows indicate the start and the end of the 
codiAg sequence Q)ropq>tide aiul mature protein). 

I^ure 5« Predicted Andno Add Sequoioes of Epo nropeptide. 
The amino add sequences are derived from data obtunedfinombodigemmic 
IV1/E3C2R and cDNA anqilifications. The nmnbering oorre^nds to die 

25 human sequence. Ala 1 is die first amino add of die human mature protein. 
The asoendiiig arrows show (he site of the deavage by die agnal peptidase, 
as detenmned for die human and cynomolgus monkey proteins. 

Figure 6. AUgnment of the Frimaiy Stroctnres of mature 
mnmmflliaii Epo Froteins« The human, Macaca fascicularis and mouse 

30 ammo acid sequences were previously reported (Lin, F.-K. et al. , Proc. Natl. 
Acad. Sd. USA 52:7580-7584 (1985); Un, F.-K. eial.. Gene 44:201-209 
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(1986); McDonald, J.D. et al, Mol. & Cell Biol. 6:842-848 (1986)). The 
residue numbers correspond to the 166 aa of the mature human protein. Plain 
boxes indicate the positions of the predicted four a-helices in the human 
sequence (See Example II). Dashed boxes show the N- and 0-glycosylation 

5 sites. Limits between cxons are indicated by the vertical lines. 

Figure 6A. Schematic representation of the mammal 
erythropoietui. The invariant amino acids among the eight sequences shown 
in Figure 6 are represented by the black boxes. The localization of each of 
the four a-heliccs is underlined. The three glycosylation sites are shown as 

10 diamonds. The main disulfide bridge is indicated by the heavy line connecting 
noncontiguous sequences. The small arrows under the sequence indicate the 
short SH-bridge, missing in the rodents. 

Figures 7-7B. Riylogcnetic lineages derived from analyses of 
mammalian Epo cDNA sequences encoding 
15 the full length mature protehis from seven 

species. 

Figure 7. Strength of groups hi the maxfanmn parshnony tree 
found on examming all 945 unrooted trees formed by seven tennbial taxa. 
This tree of lowest length required 374 base substitutions. Each link between 

20 ancestral nodes has a circled number; this strength of grouping number is the 
minimum number of substitutions that must be added to the length of the 
maximum parsimony tree to find a tree diat breaks down the barrier (moves 
one pr more sequences) between the two groups sq)arated by die interior link. 
Figure 7A. Hie phylogen^c tree derived from the maxhnum 

25 patrimony reomstniction. On the basis of odier molecular evidence 
involviqg comparative amino add sequence data ftom monotremes, 
marsupials, and many eutherian qiecies (Czelusniak, J. et al., MeSi. EnzymoL 
iS?:601-615 (1990); Czelusniak, h etal. In ^Cmtvx Mammolc^gy", 
Volume 2, 3rd ed., H, H. Ganoways, pp. 545-572, Plenum PubK <3oip. 

30 (1990)) the root of this 1^ phyiogenedc tree is placed on die interior ludc 
sqnradng the artiodactyl group from die primate, rodent, cat groq>« The 
numbers cm die intemodal links rq>resent die numbers of base purs by which 
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the nodal ancestral and descendant sequences differ. MYA = millions of 
years ago, as inferred from paleontologicai views of eutherian phytogeny. 

Figure 7B. Parsimony reconslniction on that portion of the 
cDNA sequences that arc codons for amino acids. The numbers shown as 
5 a fraction on each linic are, in the numerator, the number of amino acid 
changing base replacements and, in the denominator, the number of silent base 
replacements. The computer algorithm that carried out this calculation is 
described in Czelusniak, J, et al. Nature 298:297-300 (1982) (see the legend 
for Figure 2). 

10 Figures Phylogenetic lineages derived from analyses of 

the mammalian Epo intron 1-exon 2 
sequences. 

Figure 8. Strength of groups in the maximum parsimony tree 
found on examinmg all 135,135 trees formed by nine terminal taxa. As 

15 the two fcloid sequences (those of cat and lion) arc nearly identical, they were 
treated as a single taxon in this strength of groiq)ing analyds. The maximum 
paramony tree for the scgn»nt (about 285 bp) of intron 1 and cxon 2 from ten 
species required 361 base substitutions. The cirded numbers are die strength 
of grouping numbers (as defined in die Figure 7 legend). 

20 F^mt 8A. The near most pardmonious tree that groups dog 

inthfdoidte. TUs tree reqmred 365 base substitutions. The numbers on Unks 
iq>resent numbers of base purs by wfaich tiie ix>da] ancestral and d e sce n dan t 
seqtaeaoes differ. 

F^rts Modd of tt^ Three-IXmensiaMl Snuiim of 

25 ByArojM^itin. 

Figun9. Ribbon ^ram of E^Urtiarysintetttre. Thefouro 
helices are labeled A to D; loops between helices m apprc^rialely named. 
Disulfide bridges are shown and N- and 0-glycosylation sites are indicated 
respectively by dark and dotted segments. 
30 Figure 9A. Schematic rq)resentation of ^)o's primary structure 

depicting predicted up-up-down-down orientation of the four antiparallel o 
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helices (boxes with arrowhead). This folding pattern is strongly suggested by 
the large size of the two interconnecting loops AB and CD. The limits of each 
helix were drawn according to Table 11 of Example 1. A predicted short 
region of ^shcet is delineated by the dashed rectangle. The N-glycosylation 
5 sites arc represented by the dotted diamonds, the 0-glycosylation site by the 
dashed oval. The locations of the two disulfide bridges arc shown as solid 
lines. 

Figure 9B. Cmss-secHon of the Epo molecule at the level of the 
four a helices. The helical wheel projections are viewed from the NHj-end 

10 of each helix. The hydrophobic residues, localized inside the globular 
structure, are indicated by filled circles. The charged and neutral residues 
(open and gray circles respectively) are exposed at the surface of the molecule. 

F^ure 10. Immunopredpitations of wild type Epo and the 
A140-144 mutein. Cos? cells were transfccted wth pSG5, pSG5-^/wt or 

15 pSG5-Epo/A140-144. Afterthiee days, die cells were metaboUcally labeled 
wid) I*S>med»ioiune and pSJ-cystwue. Imrounqirecipitations of cellular 
extracts and supematants were performed widi our polydonal aittibody, raised 
in rabbit agwnst die native human Bpo. The immunoptedpitaies were 
analyzed by SDS-polyactybmide gel electrq>horesis: Lanes2to4corTe^nd 

20 to cdlular extracts, and lanes 5 to 7 cotre^oiid to culture siqjermtants, ftom 
Cos? iransfonned widi die foDowiog: phundd without insert (1^ 
wild type ^ (lanes 3 and and A140-144 (lanes 4 and 7). Lane 1 
repiesenis die protdn molecular wdght standard. The two amws Aow die 
normal secretion of die wild type Epo (35-37 IcD) and dw cytoplasmic 

25 retention of die mutein A140= 144 (-28 kD). 
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Figures 11-llB. Interconnecting Loop AB. 
Figure 1 1. Schematic representation of the loop AB showing the 
localization of muteins with various deletions and amnio acid 
replacements. The dashed arrows point to the positions of the serine 

5 substitutions (in A48-52). The two N-glycosylation sites are represented by 
the gray diamonds. The small Cys29=Cys33 disulfide bridge is indicated. 

Figure JIA. Amount and biological activities of secreted muteins. 
The upper bar graphs show the relative secretion of wild type and loop AB 
muteins as determined by radioimmunoassay. The lowest bar graphs display 

10 the calculated specific acuvity (ratio bioassay/RIA) for each mutein, in 
comparison with die value obtained for the wild typt (ratio = 100%). 

Figure IIB. HCD57 cell proliferation as a function of Increadng 
concentration of vlld type and s«ine«ubstituted Epo mutdns. HCD57 
cells (lO* per ml) were cultured for three days in a 96 well microtiter plate 

IS with media conuuning inoeaaqg conoNitrations of secreted protdns 
(mU-milli-uiuts of Epo, a leladve mass measurement). The line ^lajAs show 
the cdlular growth as measured by 'H-thynu*i» vipoiit for cdls cultured with 
wild type Epo (O), Epo mutein F48S (O), Epo mutein Y49S (X), Epo mutein 
A50S (♦), Epo mutein W51S(o). and Epo mutein K52S (a). The number 

20 of viable cells was also measured with the MTTooloriffletric assay anlgaw 
similar curves. Pioiifeiation experiments usii« the himianlJT-7/Epocdll^ 
(Komatsu, N.. et al.. Cancer Ra, 5J:341-348 (1991)) and the Kiystal assay 
(Krystal. G., Exp. Hematol. 77:649-60 (1983)) produced identical results. 
Figures 12-12A. Interconnecting Lo<q> CD. 

25 Figure 12. Schematic represoitation of the loop CD diowing the 

location of three deletion muteins: A10M09, Alll-119, A122.126, and the 
insertion of seven residues after Lysll6 (myc epHope). The 0-glycosylation 
»te is indicated by die dashed oval. 

Figure 12A. Secretion and biological activities of the muteins 

30 located In loop CD. Hie two bar graphs were created as described in 
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Figure 1 lA, The two mutants All 1-1 19 and 1 16/myc were normally secreted 
and had full biological activities. 

Figures 13-13A. In Vitro Translation of the Epo Wild Type, 
Figure 13. Analysis of the ^'S-labeled translation products by 
5 SDS-PAGE. One-step transcription/translation reactions were performed in 
the SP6-TnT rabbit reticulocyte iysatc system. 1/30 of each reaction was 
resolved on a 15% polyacrylamide gel. Lane 1-low Mr standard from 
Amersham; lane 2-in vitro reaction without added plasmid; lanes 3 and 4 are 
translation products obtained after incubation of 1 /ig of circular p64T-Epo, 
10 In the presence or absence of canine pancreatic microsomal membranes, 
respectively. 

Figure 13A. Binding of the in vitro translated Epo wild type onto 
1^0 Receptor-GTS-agarose beads. 6x10^ cpm (counts per minute) of 
purified "S-labeled erythropoietin products, processed with microsomes (+) 

IS or not (-) were incubated in the presence of the extra-cytxq)lasmic domain of 
the Bpo receptor (ERE.), following the protocol described by Harris, K.W., 
etd., J. Biol. Chem. 267:15205-15209 (1992). Identical binding 
demonstrated that the conservation of the prq)q>tide did not ixapdir the 
honn(me/recq>K>r interaction. 

20 Figures 14-J4A. COOH-end of Epo. 

' Figure 14. SchematiG represatatkm of the ana^yied mntdns, 
corresponding to the ddedon of die four last amino acids A163-166 and the 
replacements of the rendues 162 to 166 by a KDEL or poly (His) sequences. 
Figure 14A. Relative secretion of these mntdns. Hie bioacdvities 

25 in the supemaiants (Krystal, G., fiqp. HmatoL J J:649-60 (1983)) and die cell 
extracts (Komatsu, N., et al., Cancer Res. 57:341-348 (1991)) of transformed 
Cos? cells were measured by in \itro proliferation assay using HCD57. More 
KDEL mutant remsuned in die cytosol of die Cos7, when compared widi die 
iRdId type Epo and A163*166 or poly (His) muteins. However, all die 

30 analyzed muteins had die same ^ecific activity as that of die \M type. 
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Figure 15. Bacterial Expression of Wild Type Epo. 
Figure 15 Panel A. Diagram of the fusion protein. An NHj- 
terminal 22 amino acid long peptide, containing a 10 histidine stretch, was 
fused to the mature erythropoietin sequence. Factor Xa cleavage allowed the 

5 recovery of the mature Epo with only two extra residues at its amino terminus. 

Figure 15 Panel B. IPTG induction of the fusion protein. 
Transformed £. coli BL21 (DE3) cultures (ODfioo^O.6) were grown in 
presence of 1 mM IPTG. Aliquots were collected at 0 (lane 2), 1 (lane 3)» 2 
(lane 4) and 3 hours (lane 5) and analyzed on a 15 % SDS-polyacrylamide gel, 

10 stained with Coomassie Brilliant Blue. High level production of the fusion 
protein was rapidly obtained. Lane 6 corresponds to an aliquot from 
transformed bacteria grown for 3 hours in a medium without IPTG. Lane 1 
is a low molecular weight standard. 

Figure 15 Panel C. Purification of the fusion protdn. After 3 

15 hours of IPTG induction, the produced (His)io-Epo was solubilized in 6 M 
guanidine-HCl and purified on a nidccl affini^r resin by increasing imidazole 
concentrations following the pET-His system protocol (Novagen). Samples of 
the column eluants were analyzed by SDS-PAGE and stained with Coomassie 
Brilliant Blue. Lane 1-clution by 20 mM imidazole; lane 2-eluiion by 100 

20 mM imidazole, releasiiig the fusion protein; lane 3-chelation of the nickel by 
a 100 mM EDTA wash; lane 4-nK)lecular weight standard. 

Figure 15 Panel D. Detection of the E. coU recombuiant Epo on a 
Western blot. Solubilized proteins were sq)ara!ed on a 15% SDS- 
polyaoylamide gel, transferred to a nitrocellulose membrane and probed widi 

25 a 1/2000 dilution of our native wild type polyclonal antibody, as described 
under Materials and Metiiods. Lane 1-analysis after oxidative reduction; 
lane 2-after dialyas against the factor Xa buffer; and lane 3-after factor Xa 
cleavage* 

Figure 16. Relationsbip Between Production of Muteins and 
30 PnqMised Secondary Structure. This bar grq>h shows the amount of 
secreted proteins in die supematants of transiently expressed Bpo mutants, as 
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10 



detected by radioimmunoassay. The muteins were aligned over a schematic 
representation of ihe native Epo molecule. Each deletion is shown as a 
stippled bar. the width of which is proportional to the number of residues 
deleted. The four a helices are represented by the black rectangles. The two 
disulfide bridges are indicated. These mutagenesis results are in good 
agreement with our proposed four o-helical. model of Epo. 

Figure 17. Biological Activity of Helix A MUteins. The top panel 
shows a helical wheel projection of Epo helix A witii arrows indicating the 
positions where modifications were made to create muteins. The lower panel 
is a dose response curve representing the biological activity of wild type Epo 
(0) and helix A muteins E13A (♦ ). RMA (D), E18A (0), K20A (•*-). and 
E21 A (□) over a range of dosages (X-axis; mU =milli-units of Epo. a relative 
mass measurement) as determined by the relative incoiporation of fH]- 
thymidine measured in counts per minute (cpm) (Y-axis) into Epo dependent 

15 HCD57 cells. 

Figure 18. Biological Activity of Helix D Mutdas. The top panel 
shows a helical wheel projection of Epo hclfai D with arrows indicating the 
positions where modifications were made to create muteins. The lower panel 
' is a dose response curve representing die biological activity over a range of 

20 dosages (X-axis) of wild type Epo (b) and heKx D mtJteins D136A ( ♦ ), 
R139A (P). K140A (O) and 143A (x) as determined by the relative 
incorporation of [»H]-tiiymidine (Y-axis) into Epo dependent HCD57 oeUs. 

Figure 19. Biological Activity of Muteins A^jacwit to HeBx D. 
The top panel shows die region adjacent to die C-terminal boundary of Epo 

25 helix D with arrows indicating die positions where modifications were made 
to create muteins. The lower panel is a dose response curve representing tiie 
biological activity over a range of dosages (X-axis) of wild type Epo (0) and 
muteins adjacent to helix D K152A (♦). K154A (D). and Y156A (O) as 
determined by die relative incoiporation of pH]-tiiymidine (Y-axis) into Epo 

30 dependent HCDS7 cells. 
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Figure 20. Comparison of Muteins With Increased Biological 
Activity. A dose response curve representing the biological activity of wild 
type Epo (-*-), and muteins R143A (□) and K154A (♦) with increased 
biological activity relative to wild type Epo is shown over a range of dosages 
5 (X-axis) as determined by the relative incorporation of pH)-tiiymidine (Y-axis) 
into Epo dependent HCDS7 cells, n. 

Figure 2L Comparison of Muteins With Increased Biological 
Activity in Non-malignant Cells. A dose response curve representing the 
biological activity of wild type Epo (-x-), and muteins R143A (□) and K1S4A 

10 ( ♦ ) witi) increased biological activity relative to wild type J^, is shown over 
a range of dosages (X-axis) as determined by the relative incorporation of 
pH]-tiiymidine (Y-axis) into murine spleen cells. 

Figure 22. Comparison of Muteins With Increased Biological 
Activity in a Human Cell Line. A dose response curve rq)resenting the 

IS biological activity of wild type Epo (x), and muteins R143A (□), and K1S4A 
(♦) mih increased biological activiQf (relative to wild type Epo) is shown 
over a range of dosages (X-axis) as determined by the relative incorporation 
of pHl-diymidine (Y-axis) into human Epo^q>endent UT-l/Bpo cell line 
. (Komatsu, N. et al.. Cancer Res. 51: 341-348 (1991). 

20 l^ure 23. Comparison of Mutdns irfth Increased Biological 

kc&nXs in a Human Cell Iine« A dose re^onse curve rqpresenting the 
biological activity of v^ld-type ^ (Q) and mutein N147A ( ♦ ) witii increased 
bioipgical activity (relative to wild-type Epo) is shown over a range of dosages 
(mU) (X^is) as determined by tiie relative incorporation of pH)-tiiymidine 

75 (Y-axis) into human ^)o^ependent UT-7/1^ cell line. 

Figure 24. Comparison of Muteins with Increased Biological 
Activity in a Human CeD Line. A dose re^nse curve representing the 
bidogical activity of wild-^ Epo (Q) and mutein N147A ( ♦ ) widi increased 
biological activity (relative to wild-^ Epo) is shown over a range of dosages 
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(mU) (X-axis) as determined by the relative incoiporation of pH]-thymidine 
(Y-axis) into human Epo-dependent UT-7/Epo cell line. 

Figure 25. Comparison of Muteins with Increased Biological 
Activity in a Human Cell Line. A dose response curve representing the 

5 biological activity of wild-type Epo (Q) and mutein N147A (♦ .D) with 
increased biological activity (relative to wild-type Epo) is shown over a range 
of dosages (mU) (X-axis) as deteimined by the relative incoipoiation of pH]- 
thymidine (Y-axis) into human Epo-dcpendcnt UT-7/Epo cell line. 
Biological Activities of Helix A Muteins. 

10 Ftgun 26. Comparison of Muteins with Increased Biological 

Activity in a Human CeU Line. A dose response curve representing the 
biological activity of wild-type Epo (0) and mutein K20A with increased 
biological activity relative to die wild type Epo, is shown over a range of 
dosages (X-axis) as determined by die relative incorporation of pH)-diymidine 

15 (Y-axis) into UT-7 ^ cells. 

Figure 27. Comparison of Muteins with Increased Biological 
Activity in a Human Cell Line. A dose response curve representing die 
biological activity of wiW-typc Epo (0) and mutein Y49S widi increased 
biological activity relative to die wild type is shown over a raqge of 

20 dosages (X-axis) as determined by die relative inooipoiation of pHJ-diymidine 

(Y-axis) into UT-7 Epo cells. 

Biological Activities of HeOx B Muteins. 

J^urt 28. Comparison of Mutdns wifli Increased Blotog^l 

Activity fa a Human Cell Lfae, A dose response curve representing die 
25 biological activity of wild-type Epo (0) and mutein A73G witii incieased 

biological activity relative to die wild type I^, is shown over a range of 

dosages (X-axis) as determined by die relative incoiporation of pHJ-diymidine 

(Y-axis) into UT-7 Epo cells. 

Biolo^ Activities of Helix D Mutdns. 
30 Figun 29. Comparison of Muteins witii Increased Biological 

Activity in a Human Cell Une. A dose reqwnse curve iqiiesenliiig die 
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biological activity of wild-type Epo (o) and mutein K140A with increased 
biological activity relative to the wild type Epo. is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of ['H]-thymidine 
(Y-axis) into UT-7 Epo cells. 

5 Figure 30a. Comparison of Muteins with Increased Biological 

Activity in a Human Cell lint. A dose response curve rq>resenting the 
biological activity of wild-type Epo (b) and mutein R143A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incoipotation of pHJ-thymidinc 

10 (Y-axis) into UT-7 Epo cells. 

F^un 3m>. Comparison of Mutdns with Increased Biological 
Activity in a Mouse Cell Une. A dose response curve representing the 
biological activity of wild-type Epo (0) and mutein R143A with increased 
biological activity relative to the wild type Epo. is shown over a range of 

15 dosages (X-axis) as determined by the relative incoipoiation of pH>Aynudine 
(Y-axis) into HCD 57 cells. 

Figure 31. Comparison of Mutdns with Increased Biological 
Activity in a Human Cell Line. A dose rei^XMise curve iq)resenting the 
biological activity of wild-type Bpo (B) and mutein S146A wth increased 

20 biological activity relative to the wild type Bpo, is shown over a range of 
dosages (X-axis) as determined by the relative incoiporadon of pHI-diymidine 
(Y-axis) into UT-7 Epo celb. 

Figun 32. Comparison of Matdns with Increased Biological 
Activity in a Human Cell Line. A dose re^nse curve rq>resentii% die 

25 biological activity of wild-^ Epo (B) and mutein N147A wiA increased 
bicriogical activity relative to the wild type Epo, is shown over a range of 
dosages (X-«xis) as determined by the relative incoipotation of pH]-thymidine 
(Y-axis) into UT-7 Epo cells. 

Figun 33a. Comparisan of Mutdns with Increased Biological 

30 Acti^ in a Hmnaii Cdl Line. A dose re^onse curve rq)reseiitlng die 
bioio^cal doanty of wild-^pe Epo (Q) and mu^n K154A widi increased 
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biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of I^H]-thymidine 
(Y-axis) into UT-7 Epo cells. 

Hgure 33b. Comparison of Muteins with Increased Biological 

5 Activity in a Mouse Cell line. A dose response curve representing the 
biological activity of wild-type Epo (□) and mutein K154A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of l^H]-tiiymidine 
(Y-axis) into HCD 57 cells. 

10 Figure 33c. Comparison of Muteins with Increased Biological 

Activity in Non-Malignant Mouse Cells. A dose response curve representing 
tiie biological activity of wild-type Epo (B) and mutein K154A witfi increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by tfie relative incorporation of pH]-tiiymidine 

IS (Y-axis) into primaiy murine ^leen cells. 

Figure 34. BloactMty of ^Replacmient Muteins. The letters in 
the column on the left designate predicted helices (A, B. C, D) or interfaelical 
loops (A-B. C-D). The 3'D designation (3' of D) in the table represents 
anuno acids 153-166 i^rtiich arc at the carboxy-cnd of the DJiclix. The amino 

20 add of vM type human Epo, its position from the N-termimis, and its 
replacement according to the single anuno add fetter code, are listed under tte 
mutein column, e.g. S9A, r^resents a mutation from the wiU-Qrpe human 
Epo at the anuno add serine of portion number 9 to alanine. The three 
bioassay columns (human UT7 cell line, murine HCDS7 cell line, and primary 

25 mouse ^leen erytiuoid cells) show q)ecific bioactivity of each mutein, 
expressed as a percentage of wild type human Epo bioactivity, witii tiie 
background COS supernatant alone subtracted from tiie value. The mean and 
standard deviation of tiie bioassay values are listed for n^3 determinations. 
The numbers widiin tiie brackets indicate tiie number of separate COS cell 

30 tiansfections over (/) die number of sq)arate bioassays tiiat were performed. 
NS: not secreted from COS 7 cells. 
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Definitions 

In order lo provide a clearer and more consistent understanding of the 
specification and claims, including the scope to be given such terms, the 
following definitions are provided. 

5 Administratbn. The term "administration" is meant to include 

introduction of the Epo muteins of the invention into an animal or human by 
any q)propriate means known to the medical or veterinary an, including, but 
not limited to, injection, oral, enteral, and parenteral (e.g., intravenous) 
adminisuation. 

10 Amino Acid Codes. The most common amino acids and their codes arc 

described in the foUowiqg table: 
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Table 1 
Amino acid names and codes 



Amino acid 


Single letter 

code 


1 nrcc icuci 
code 


Alanine 


A 
A 


Ala 


Arginine 


D 

l\ • 


Arc 


Aspartic acid 


U 


Acn 

/up 


Asparagine 


N 


A cn 


Cysteine 


C 


Pvc 


Glutamic acid 


i: 


VJJU 


Glutamine 


w 


Gin 


Glycine 




Glv 
uiy 


Histidine 


n 


His 


Isoleucine 


1 
1 


He 


Leucine 


L 


Leu 


Lysine 


K 


Lys 


Methionine 


M 


Met 


Phenylalanine 


F 


Phe 


Proline 


P 


Pro 


Set'wt 


S 


Ser 


Threonine 


T 


Thr 


Tryptophane 


W 


Tip 


Tyrosine 


Y 


iy 


Valine 


V 


Val 



Amnud. The lenn "ammal" is meant to include all ammals whose 
25 pnxluction of red blood cells (eiyduocytes) Is dependent upon, or stimulated 
by. eiyduopoietin (Japo). Foremost among such animals are humans; 
however, the invention is not intended lo be so limiting, it being widiin die 
contenvladon of the prcscm invention to neat any and all animals which may 
experience the beneficial effect of die Epo muteins of the invention. 
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Efficacious Amount. An "efficacious amount" of an Epo mutein of the 
invention is an amount of such Epo mutein that is sufficient to bring about a 
desired result, particularly the stimulation of red blood cell production, 
especially upon administration to an animal or human. 
5 Bythmcyte. The term "erythrocyte" is Intended to refer to a ted blood 

cell. 

Erythroid Pncunor Cells. By "erythroid precursor cells" is intended 
cells with a capability to proliferate and differentiate into erythrocytes that is 
dependent upon exposure or contact with Epo. 

10 Etyihmpoktin (Epo). The term "erythropoietin", or "Epo" is 

primarily intended to refer to the mature form of this hormone. The terms 
"wild type Epo" and "native Epo" are used interchangeably w refer to In 
its naturally occurring, unmodified form. Unless otherwise indicated, amino 
acid positions on the Epo protein referred to herein are made with reference 

15 to the mature. 166 amino add human Epo sequence shown in Figure 6 and 
corresponding sequences from other species. 

Host CeU. By "host" or "host cell" is intended the cell in which a gene 
encoding an Epo mutein of the invention is incorporated and expressed. An 
' 1^ mutein gene of the invention may be introduced into a host cell as part 

20 of a vector by transformation. 

. Mutein. By "mutein" is meam a mutant or modified protein wth one 
or more modifications to its nadve amino acid sequence in the form of anuno 
ac^ additions, deletions, or substitutions. An eiytiiropoietin mutein refers to 
a modified eiythraixHetin pioidn. 

25 Phamaeeutiedfy AeeeptabU VMete. The term "pharmaceudcally 

accqitable vehicle" is intended to include solvents, carriers, diluents, and tiie 
like, wluch are utilized as additives to preparations of die 1^ muteins of tiie 
invention so as to provide a carrier or adjuvant for die adnunistration of sudi 
1^ muirins. 

30 Trmsfomadon, By'transfbnnation'islntendedtheactofcananga 

host odi to contain a deaved nucldc acid moleode, including dther a native 
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Epo gene or a gene encoding an Epo mutein of the invention, not originally 
part of that cell using methods known in the art. 

Transfecdon. By "tiansfection" is intended the introduction of a DNA 
or RNA vector carrying a desired nucleic acid molecule, including either a 
5 native Epo gene or a gene encoding an Epo mutein of the invention, into a 
host cell. 

Vector. By "vector" is intended a DNA element used as a vehicle for 
cloning or expressing a desired gene, such as an Epo mutein gene of the 
invention, in a host. 

10 Detailed Description of the Preferred Embodiments 

This invention is drawn to modifications of Epo which enhance its 
biological activity. Due to their enhanced activity, the Epo muteins of the 
invention provide a preferable alternative to native ^ where it is used to 
stimulate red blood cell production (e.g. treatment of blood disorders, cell 

IS culture, etc.). 

The present invention teadies a structural model of the Epo protein 
useful for identifying functional regions whidi may be modified to enhance 
biological activity. According to die invention, Epo is predicted to have a 
four anti-parallel amphipatfaic a-bdical bundle structure. Based iqxm diis 

20 predicted structure, fui^tionally important regions predicted to reside on Ae 
external surfaces of the helices (designated A-D. see Figures 9A-9B) arc 
identified. In particular, tiie external surfeces of helices A and D, predicted 
to be involved in ^ recq>tor binding, are identified. According to the 
invention, these functionally important regions serve as targets for 

25 modifications which may enhance biological activity. Increased activity 
relative to the wild-iype ^ can be found witii amino acid substitutions in the 
A,B,C or D helix. 

Aooordiiig to one zsp&A of die invention, 1^ muteins are provided in 
which one or more of die anuno adds widun tlie predicted ^eternal surfaces 
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of helices A, B, C, D and 3' of D, or within the region immediately adjacent 
to the C-terminal end of helix D, have been replaced. In particular, amino 
acid substitutions at positions 20, 49, 73. 140, 143, 146. 147 and 154 of wild 
type Epo have been replaced. The inventors have discovered that substitution 

5 of the amino acids normally occurring at these positions results in Epo muteins 
with significantly higher biological activity than wild type Epo proteins. 

According to the invention, the amino acids at position 20 (typically 
alanine), 49 (typically serine), 73 (typically glycine). 140 (typically lysine), 
position 143 (typically arginine), position 146 (typically serine), position 147 

10 (typically asparaginc), and 154 (typically cither lysine or threonine) of wild 
type Epo are preferably replaced with an alanine residue. In addition to 
alanine other amino acids with a chemical structure and properties similar to 
alanine, such as serine and threonine, are contemplated by the invention as 
suitable substitutes for achieving Epo muteins of Uie invention with enhanced 

15 biological activiQr. 

In another aspect of the invention, a portion of flic predicted inteiloop 
region between helices A and B extending from amino add 48-52 is identified 
as important to Epo function. According to tiie invention, diis functionally 
important region serves as another target for modifications which may enhance 

20 biological activity. The inventors have discovered diat substitution of die 
amino acid at poation 49 results in Epo mutwiis wifli significamly higher 
biological activity tiian wild type Bpo proteins. 

According to die invention, die amino acid at position 49 (typically 
tyrosine) of wild type 1^ is preferably replaced widi a serine residue. Odier 

25 amino acids wdi a diemical structure and properties similar to serine, such 
as alanine and direonine, are contemplated as suitable substitutes for achie^ng 
Bpo muteins of die invention widi enhanced biological activity. 

The modifications taught by die present invention occur in regions diat 
are highly conserved among Epo proteins from evolutionarily divergent ^ies 

30 (see Example I and Figure 6 in particular). These modifications are dius 
txpMtd to be broadly iq>plicable to Epo protdns whidi are substantially 
similar to die human Epo protein in die regions where die modifications oqcur, 
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including, but not limited to, Epo proteins from monkey, mouse, rat. sheep, 

pig, cat, and dog. 

Biological activity of Epo muteins can be determined by assaying the 

ability of these proteins to stimulate the proliferation of Epo responsive cells 
5 such as murine spleen cells (Krystal. G.. Exp. Hematol. Ji:649-60 (1983); 

Goldberg. M.A., etcL. Froc. Natl. Acad. Sd. USA 84:im-im (1987)). 

murine Epo responsive murine erythroleukemia cells (Hankins, W.D., et al., ■ 

Blood 70:173a (1987)). and human Epo^cpendent UT-7/Epo cells (Komaisu, 

N.. et al.. Cancer Res. 57:341-348 (1991)). 
10 When assaying for biological activity, it is important to recognize that, 

as taught by the present invention, high levels of biologically active Epo can 

cause terminal maturation and cessation of division of Epo lesponsive cells. 

Because of this possibly confounding phenomenon, it is importam to determine 

biological activity based on a full dose response curve over a broad range of 
15 dosages which includes non-saturating levels of the parUcular 1^ mutdn 

being assayed. 

Although the mechanism by which the modificaiions taught by the 
invention increase biological activity is not completely understood, one 
possibility is that these modifications enhance recq)tor binding affimty. This 

20 possibility is Aought to i^ply partolariy to the modifications at portions 20, 
49. 73. 140. 143. 146. 147, and 154. which lie within a legion predioed to 
be involved in 1^ recq>tor binding according lo die teachings <rf the present 
Invention. 

Genes Encoding Epo muteins (Epo mutein genes) 

25 In addition to tiie ^ muteins themselves, genes encoding tiiese 

muteins are also contenq)latBd by tiie present invention. Genes encoding tiie 
Epo muteins of tiie invention may be made by modification of tiie native ^ 
gene uang standard mediods well known to diose of skill in tiie art 

The native ^ goie may be obtained mv% a variety of standard 

30 techniques. For example, a DNA molecule corresponding to die known 
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sequence of the Epo gene from a number of species may be artificially 
synthesized (See Example I and Figure 4). The Epo gene may also be isolated 
from a cDNA or genomic library using nucleic acid probes based on available 
Epo DNA sequence information and standard hybridization techniques as 

5 taught in U.S. Pat. No. 4,703.008. issued Oct. 27. 1987. which is herein 
incorporated by reference (See also Jacobs et al.. Nature 313: 806-810 
(1985)). Alternatively, amplification of Epo gene sequences by polymerase 
chain reaction (PCR) may be accomplished uang primers based on available 
sequence data as taught in Example I. 

10 Once obtained, the wild type Epo coding sequence can be altered to 

encode an Epo mutein of the invention using standard oligonucleotide-directed 
in vitro mutagenesis techniques (See ZoHer, M. and Smith. M., Methods in 
Enzymology 100: 468-500 (1983); Dente. L. etal, Nucl. Adds Res. 11: 1645 
(1983): Kunkel. T.A.. et al.. Meth. Eiaymol. /5^:367-382 (1987); Vandcyar. 

15 M. et al.. Gene 65: 129 (1988)). These techniques generally involve 
hybridization of an oligonucleotide caning the denied alteration to a ai^ 
stranded target DNA. This is followed by complementary strand synthesis of 
the target DNA which incoipoiates the oligonucleotide cwrying the defied 
. alteration. The complemcntaiy strand containing the desired alteration is then 

20 propagated in a bacterial host cell. Spedfic plication of tiiis typt of 
tedmiqiK to modify die Epo gene is described in Exan^les II and in and in 
Australian Patem Publication No. AU-A.59145A90 by Fibi «r «/. and in U.S. 
Pat* No. 4,835.260 by Shoemaker, botii of which are herein incorporated by 
reference. 

25 Expmdon of Epo mutein Genes 

Genes racoding Epo mutdns of die invention can be introduced and 
e3q)iessed in a selected host cell system udng conventional materials and 
techniques. DNA elements sudi as promoters, enhancers, polyadeiqrlation 
sites, nansoiption termination signals, and tiie like should be assodated mtii 
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the Epo mutein coding sequence so as to promote and control expression of 
the Epo mutein. The specific regulatory elements used will depend upon the 
host cell system selected for expression, whether secretion of the protein is 
desired, and other considerations readily apparem to one of skill in the art. 
5 Various vectors may be employed as vehicles for the introduction and 

expression of Epo mutein genes in a host cell. Such vectors useful in ti.e 
different host cell types are well known and include, for example, the • 
mammalian expression vectors pSG5 (Stratagene). p-RKl (Genetics Institute). 
p.SVK3 (Pharmacia). p-EUK-Cl (Clontech). pCDM (Invitiogen). pc DNAI 
10 (Invitrogen), and the bacterial expression vectors pFLAG-1 (IBl). all pET 
system plasmids (Novagen). pTrcHis (Invitrogen). the pGEX series 
(Pharmacia), and pKK 233-2 (Clontech). These vectors may be maintained 
as episomes in the host cell or tiiey may fecilitatc integration of the Epo 
mutein gene into tiie host cell genome, or bofli. Vectors may also include 
15 oUicr useful features, such as genes which allow for the seleaion or detection 
of cells in which they have been successfully introduced. 

Host celb suitable for expression of the Epo muteins of the invention 
include, but are not necessarily limited to, E. aU, yeast, insect cells, plant 
' cells, and a variety of mammalian cell types, including in particular Chinese 

Hamster Ovary (CHO) cells. Cos? cells. Cosl cells, baby hamster kidney 
cell$. and CVl celb. Epo mutein genes can be introduced into a host cell 
uaiig standard transformation or transfection techniques. 

■ Host cells which provide for glycosylation are piefiBired for production 
of Epo muteins to be used in m w since the caibohydratt sttucture of ^ is 
25 important for optimal biological activity in viw Doidal, M.S. et aL. 
Endocrinobgy 116(6)' 2293-2299 (1985)). 

The Epo muteins of d» invention may be recovered and purified from 
host cells in which diey are expressed using known metijods such as 
immunoaffinity chromatography witit antibodies to human Epo. 



20 
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Therapeutic Use of Epo muteins ^ 
of the Invention 

The Epo muieins of the invention are suitable for therapeutic use in 
animals, particularly humans, and may be used in the same manner as wild 

5 type Epo, except that lower dosages will be required to achieve the same level 
of biological activity. 

Administration of the Epo muteins of the invention may be 
accomplished by any of the methods known to the skilled artisan, provided 
that the method effectively places the Epo mutein in the apprq)riate 

10 environment to exhibit its activity (i.e. in contact with crythroid precursor 
cells). A preferred method is by parenteral routes, including intravenous and 
subcutaneous administration. 

Administration will ordinarily include an efficacious amount of die Epo 
mutein supplied in a pharmaceuucally acoq)table vehicle. The amount of 1^ 

IS mutein whidi is efficacious will be determined by a trained professional 
dq)endiQg upon a number of oonsideiations, including tbe condition being 
treated and its severity, the sex and body weight of the subject beiiig treated, 
the metiiod of administration, and the presence of compositions in the 
pharmaceutically acceptable vehicle whldi affea Epo acti^ty. 

20 . In aiv event, die efficacious amount of a 1^ mutein as taught by die 
invention will be significandy less dian die conesponcUiig amount of 
unmodified Epo needed to achieve die same degree of efficacy. The 
difference between die efficacious amount of die 1^ mutein and die 
analogous amount of unmodified Epo will corre^nd widi die increased 

25 biological activity exhibited by die Epo mutein. It is contemplated diat an 
efficacious amoum of an Epo mutein of die invention will be as low as 10-fold 
less dian native Epo, or as low as about 5-10 units/kg body weight for treating 
chronic renal failure patients or about 10-30 units/kg body weight for treating 
HIV-infected patients \ridi serum Epo levels less dian or equal to 500 mU/ml 
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(milliuniis/milliliter) who are receiving a dose of AZT equal to. or less than. 
4200 mg/week. 

Since the Epo muteins of the invention can be used effectively at lower 
dosages rclaavc to wild type Epo, their use may reduce potential adverse 

5 effects associated with administration of Epo such as exacerbation of 
hypertension, seizures, headaches, tachycardia, nausea, clotted vascular 
access, shortness of bitath. hyperkalemia, and diarrhea (see PDR. 1993 
edition, at pp. 603-604). 

Further description of methods and materials useful in the recombinant 

10 production of the Epo muteins of the invention and their use is provided in the 
teachings of U.S. Patent No! 4,835.260 by Shoemaker and U.S. Patent 
No. 4,703.008 by Un, both of which are herein incorporated by reference in 
their entirety. 

The invention may be better understood and appreciated by reference 
15 to the following examples. These examples arc provided merely for 
illustrative purposes and should not be considered in any way as limiting the 
scope of the clsdmed invention. 

EXAMPLE I 

Erythropoietitt Stmetun-Funetion RdaHonslUps: 
20 Bigh Degree of Se^ieace Ibmobtgy Amag Mammals 

Summaiy 

In Older to investigate structure-fiinction relatioiisl^s of oyflmipm 
(Epo). wc have obtained cDNA sequences that eiioode the mamre E|» protd 
of a variety of mammals. A first set of primers, coircsponding to conserved 
25 nucleotide sequences between mouse and human DNAs, allowed us 10 amplify 
by PGR intion 1-oton 2 fragments ftom genomic DNA of hamster, cat, lion, 
dog, horse, sheep, dolphin and pig. Sequencing of tiiese fragments permitted 
the design of a second generation of qjecies-^ific primers. RNA was 
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prepared from anemic kidneys and reverse-transcribed. Using our battery of 
species-specific 5' primeis. we were able to successfully PGR amplify Epo 
cDNA from Rhesus monkey, rat. sheep, cat and pig. Deduced amino acid 
sequences of mature Epo proteins from these animals, in combination with 

5 known sequences for human. Cynomoigus monkey and mouse, showed a high 
degree of homology, which explains the biological and immunological cross 
reactivity that has been observed in a number- of species. Hunan Epo is 91 % • 
identical to monkey Epo, 85% to cat Epo and 80-82% to pig. sheep, mouse 
and rat Epos. There was full conservation of a) the disulfide bridge linking 

10 the NHj and COOH termini; b) N-glycosylation sites; and c) predicted 
amphipathic a-helices. In contrast, the short disulfide bridge (a9/C33 in 
human) is not invariant. Cys33 was replaced by a Pro in rodents. Most of 
the amino acid replacements were conservative. The C-terminal part of the 
loop between the C and D helices showed the most variation, with several 

15 amino acid substitutions, deletions and/or insertions. Calculations of 
maximum parsimony for intron 1/exon 2 sequences as well as coding 
sequences enabled the construction of cladograms tiiat are in good agreement 
widi known phylogenetic relationships. 

INTRODUCnON ^ 

20 A laije number ofeariy physiologic studies have established exttM 

though not complete, biologual cioss-reactiviiy between Epos of man and a 
number of other mammals including mouse, rai. sheep and rabbit (Jelkmann. 
W„ fftyrio/. Rev. 72:449 (1992); Garcia, J.F. ef aL, Proc. Soc Exp. BioL 
Med. 172:712-714 (19fi2): Zanjani. E.D. etal., Proc. Soc Exp. Biol. 

25 Med. J3J: 1095-1098 (1969); Biljanovic-Paunovic. L etal.. Period. BioL 
90:421-427 (1988)). In contrast, non-mammalian vertd)ratBS (anqiiubians 
(Rosse, W.F.. ef al., Blood22'.(M2 (1963); Pinder, k.etaLJ. Exp. Bbl. 
iQ5:2Q5.2l3 (1983)), biids (Rosse, W. etal.. Blood 27:654^1 (1966); 
Black, CP. et al., Respirat. Physiology 59:217-239 (1980)), fish (Zanjani. 
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E.D., et al.. Science 33:513 (1969)) have erythropoietic hormones that fail to 
cross-react with mammalian erythroid cells, and vice-versa (Rosse. W.F., 
et ai, Blood 22:66-72 (1963); Finder. A.etal., J. Exp. Biol. 105:205-213 
(1983); Rosse, W. et al.. Blood 27:654-661 (1966)). 
5 Thus far, the Epo genes of man (Jacobs, K.etel., Nature 5/5:806-10 

(1985) ; Lin. F.-K. et al., Proc. Natl. Acad. Sci. USA 82:7580-7584 (1985)), 
a monkey (Macacafasdcularis) (Un. F.-K. et al.. Gene ¥4:201-209 (1986)) • 
and a rodent, the mouse (McDonald^ J.D, et ah, Mol. & Cell Biol. «:842-848 

(1986) : Shoemaker. C.B. etai, Mol. & CeUBiol. 6:849-858 (1986)) have 
10 been cloned, sequenced, and expressed. In view of the marked crossr 

reactivity between mammalian Epos, it is not surprising that there is a high 
degree of sequence homology in the coding region of the mature secreted 
proteins. In keeping with their close phylogenetic relationship, human and 
monkey ^s are 94% and 91% identical in nucleotide and amino acid 

15 sequence respectively. In contrast, human and mouse Epos are 76% identical 
in nucleotide sequence and 80% identical in amino acid sequence. 

Direct information on the tiiree-dimensional structure of Epo is not yet 
available. Insights into structure-function relationships of Epo can be gained 
from the analyses of a more complete set of animal sequences. Such 

20 information could be usefiil for sequence-based conq)Uter modeling of three- 
dimcnaonal strucmre. Moreover, a larger data base would permit the 
identification of highly conserved domains titat arc likely to be crucial to 
folding and/or biological function. Finally, comparative amino acid and 
nucleotide sequence information on Epo provides additional data for 

25 investigating phylogenetic relationships. 

In this Example, die use of PGR based tediniques for obtaining tiie full 
coding sequences of Epos of Rhesus monkey, rat, sheq). pig and cat, as well 
as partial sequences from four otiier species is demonstrated. 
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Materials and Methods 

Animal Sampks. Human hepatoma Hep3B cell line and multiple 
kidney-derived cell lines from hamster (BHK), sheep (MDOK), pig (LLC- 
PKl). dog (MDCK). cat (CRF-K). lion (PALl-K) and spotted dolphin (SPl-K) 

5 were obtained through the American Type Culture Collection (American Type 
Culwre Collection. 12301 Parklawn Drive. Rockville. MD 20852).- 
Monolayer cells were grown in 100 x 20 mm tissue culture dishes using 
recommended media and maintained in a humidified 5% COj/SS^ air 
incubator at VC. In some experiments, cells were made hypoxic by 

10 overnight incubation in 1 % 0,, 5 % COj, 9456 N, at 37 "C. 

Poly A+ RNA prepared from Rhesus monkey kidney was purchased 
from Clontech (Palo Alto. CA). Kidneys from anemic cat and horse were 
obtained from veterinarians at Tufts School of Veterinary Medicine and the 
University of Nebraska, respectively. Anemia was induced by ti>ree 

15 consecutive d^ly intr^ritoneal injections of phenylhydrazine (60 mg/kg body 
weight) into male Sprague-Dawley rats (SD-strain) and by repeated bleeding 
of the sheep and the pig. Kidneys were asq)tically removed after induction 
and stored immediately in liquid nitrogen. 

A 4 kb genomic human Epo done (g Bpo4) was provided by Genetics 

20 Institute (Boston, MA). 

DNA and RNA Fr^ara&ons. DNAs from cell lines or homogenized 
kidneys were extiacted using pancreatic RNAase/SDS/proteinase K following 
a procedure modified from Blin and Stafford (Blin, N. «r a/.. Nucleic Adds 

to. 3:2303-2308 (1976)). 
25 Kidn^ were homogeiuzed in 4 M guaiudine isothiocyanate (10 ml/g 

of frozen organ), containing 0.1 M beta-mocaptoethanol and 10% N-lauiyl- 
' satcoane. Total RNAs were isolated by centrifugation over 5.7 M CsCl 
(Chirgwin. J.M. etal., Biochemstry i&5294 (1979)). After edianol 
piedpitation. the samples were nesu^nded in diethyl pyrocarbonate-treaied 
30 water and stored at -70°C. Confluent monolayer cells were washed twice with 
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sterile phosphate buffered saline and directly lysed in the 4 M guanidine 
isothiocyanate solution. Total RNAs were isolated as described above for the 
kidneys. 

RNA samples were first converted into single stiand cDNA. 2 to 4 ;.g 
5 of total RNA from kidney or 500 ng to 1 fig of total RNA from cultured cells 
were denatured at 68'C in the presence of 2 of oligo dT,,,,. Reverse 
transcription was carried out in a 20 ^1 final volume, containing 50 mM Tris- • 
HCi pH 8.3. 6 mM MgCK, 40 mM KCl. 10 mM DTT. 500 f»M dNTPs. 20 
u RNAsin (Promega. Madison. WI) and 20 u of avian myeloblastosis vims 
10 reverse transcriptase (AMV-SuperRT, Molecular Genetic Resources. Tampa. 
FL). The reaction was allowed to proceed at 3rC for 20 minutes, then 
at 42°C for one hour. Inactivation of the enzyme was performed at lOO'C 
for 10 minutes. cDNA samples were stored at -lO'C. 

Polymerase Chain Reaction. 20 to 100 ng of gcnomic-DNA or 0.5 
15 to Ijil of the RT-reaction were used as templates for amplification. 

Standard PCR reactions were performed in a 100 ii\ volume 
containing 50 mM Tris-HCl pH 8.3, 50 mM KCl, 1.5 mM MgCl,. 0.01 % w/v 
, gelatin, 200 iM dNTPs, 30 pM of each sense (5') and amisense (3') primers 
and 0.5 units of Taq DNA polymerase (Perkin Elmer Cetus, Emeryville. CA). 
20 After an initial denatutation step for 5 minutes at 95»C. routinely 35 cycles 
of PCR were perfonned on the DNA Thermal Cycler (Perkin Hmer Cetus). 
The denatutation step of each cydc was carried out at 94»C for Imin. Fbr 

each sample and/or the primer pair used, annealing temperatures were 
optimized (from 45*C to 60'Q for a 2-mimite reaction. A 3-minute 

25 elongation siq> at 72*C ended eat* cyde. 

Amplification products were analyzed on 1 to 3% agarose gd-TAE. 
Nu Sieve or Seapteque agaroses (FMC. Roddand. ME) were used for 
preparative purposes. Specific PCR products were recovered from the gd, by 
the use of the Gene aean n kit (Bio 101. U Jdla. CA). by Spin X 

30 centrifugation (Costar. Cambridge. MA) or by standard phcnol/dilorofiorm 
extractions of mdted gel slices. 
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Subcloning and Sequencing. Small (250-330 bp) amplified genomic 
fragments were directly subjected to automated sequencing, using the PCR 
primers as DNA sequencing primers. 

PCR-generated cDNA products were blunt-ended using Klenow 

5 fragraem and 5' phosphorylated by T4 DNA kinase (Boehringer Mannheim, 
Indianapolis, IN). They were then cloned into the Smal site of the phagcmid 
pBluescript II SK (Stratagene, U Jolla. CA). The double-stranded plasmid" 
was sequenced using the SK and KS sequencing primers. 

In all cases. Sequencing reactions were carried out on an Applied 

10 Biosystems 373A Automated DNA Sequencer utilizing ABI's DyeDeoxy 
Terminator Sequencer kit (Foster City. CA) and thermal cycling with Taq 
DNA polymerase (Promega. Madison, WI) as previously described (Tracy, 
T.E. et al., Biotechniques 77:68-74 (1991)). To avoid errors, for each 
species, samples derived from different amplification/cloning experiments were 

IS piq)ared and both strands were sequenced. 

MammaUan Expmsba; Bioaedvity. Vectors containing full-length 
mammalian Bpo codng sequences were subcloned Into pSG5 plasmid 
(Pharmacia, Piscaaway, NJ) and were transiently expressed in Cos? cells. 
724>our supcraaiants were tested for thdr ability to sustain cellular 

20 piDlifoa^on of the ^K>-dq)eadent HCD57 cell Une (Sawyer. S.T. et al., 
Blobd 72 (suppl. 1): 132a (1988)). 

ResuUs and iXseusslon 

Because of the known iHOlo^cal cross-rBactiviqr between various 
mammalian Epos and the presumpdon of strong homology in die coding 
25 sequences, a logical cloning strategy would be to screen cDNA libraiies widi 
moderate to low stringency. However, Epo mRNA is expressed at barely 
detectable levels in all tissues except in hypoxic kidney and liver (Goi^ig. 
M.A. a al., Proc. Natl. Acad. Sd. USA 84: 7972-7976 (1987); Fandrey, J. 
etal.. Blood 81: 617-623 (1993)). Therefore it would be necessary to 
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generate libraries for each of the species of interest. Our primary research 
interest is in structure-function relationships of Epo and therefore we require 
full coding sequences of the mature Epo protein, rather than full cDNA 
sequences. Accordingly, we elected to employ a PCR cloning strategy 

5 predicated on known strong homologies in Epo cDNA sequences that flank the 
region encoding the mature protein. There was sufficient conservation of 
sequence around the NCOl site in the 3' untranslated region (UTR) that a- 
single primer could be used to amplify full length mature Epo coding sequence 
in rat, sheep, pig and cat as well as dog and rabbit (datt not shown). 

10 However, we were unsuccessful in finding a universal 5' primer and tfierefore 
had to generate species specific sequences from amplifications of genomic 
DNA. This two stage PCR strategy proved to be satisfactory in generating 
highly accurate sequence information with a minimum expenditure of time and 
resources. In all instances, there was full agreemem between sequences of 

15 complementary strands. 

Preparation of Mammalian Epo cDNAs 

Primer Designfor PCR Ampmation of Epo. The sequences of Epo 
genes from tiiree species have been already published: two primates, human 
(Jacobs, K. et al.. Nature 313:m-m (1985); Un. F.-K. et a/., Proc. Natl. 

20 Acad. Sd. USA 52:7580-7584 (1985)) and Cynomolgus monkey {Macaca 
faxiailaris) (Un. F.-K. et al.. Gene ^:201-209 (1986)) and a rodent, the 
mouse (M«s mustadus) (McDonald. J.D. et al., Mol. & CeU Biol. 6:842-848 
(1986); Shoemaker. CB. et al., Mol. & Cell Biol. 6:849-858 (1986)). There 
is substatitial homology at ihe nucleotide level not only in the coding 

25 sequences, but also in portions of introns and in 5' and 3' untranslated 
sequences of cxons 1 and 5. respectively. A number of primers, 
cone^nding to conserved nucleotide sequences between mouse and-human, 
were synthesized and tested for their ability to produce gcnomic-PCR 
fragments from a wide variety of mammalian cell Hues. A 4 kb human 
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genomic clone and genomic DN A extracted from the human hepatoma cell line 

Hep3B were used as control templates for the PGR amplifications. 

In panicular, two pairs of primers. 1V1/EX2R and EX5/NC01 

(Figure 1), directed the amplifications of fragments of 330 and 250 bp 
5 (respectively) from genomic DNA of all human and mammalian cell lines that 

we investigated. I VI is a 25-mer oligonucleotide localized in intron 1, 

216/217 bases upstream from the start of cxon 2. Human and mouse are- 

identical except that the human sequence contains an extra G nucleotide at 

position 996. IVl primer corresponding to the mouse sequence was 
10 synthesized. The 23-mer EX2R ends 28 nucleotides upstream from the 3' end 

of exon 2. EX5 is a 22-mer, beginning 34 bases downstream from the 5' end 

of exon 5. NCOl represents a 20 bp, 100% conserved DNA fragment, 

starting 1 12/1 13 bases downstream from the TGA stop codon and contains the 

unique NCOl site present in the human and mouse gene. 
15 An ATG primer was also synthesized, corresponding to a 20 

oligonucleotide stretch, extending both 5' and 3' from the initiator methioiune 

codon. This sequence is also totally conserved between primates and mouse. 

While combinations of exonic primers (ATG1EX2R, EX5/NC01 and 

ATG/NGOl) were able to direct amplification from cDNA prepared from 
20 RNA of the human Epo-producing cell line Hep3B, no amplification was 

obtaiMd on cDNA prq)ared from uninduced or hypoxia-induced BHK, 

MDOK, LLG-PKl. MDCK, CRF-K, PALl-K and SPl-K cells (results not 

Aown). Theiefoie these renal cell lines apparently do not express Epo 

mRNA, cither constitutively or after hypoxic stress, at a level detectable even 
25 by RT-PCR. Furthermore, no Bpo protein was detectable by 

radioimmunoassay in the supematants of cells maintained overnight in 1 % O2, \ 

5% COa, 94% N2 at 37*G (Goldberg, M.A. etai, Proc. NatL Acad. Sd. 

USA 84:7972-7976 (1987)). 

The amplification of species specific IV1/EX2R and EX5/NC01 
30 genomic fi:agments allowed us to design a strategy shown in Figure 2 for 

obtaining cDNA clones contaiiung tiie complete coding sequence of mature 
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Epo protein from various mammals. The IV1/EX2R primer pair resulted in 
a genomic fragmem (-330 bp) corresponding to the 3' third of die first intron 
and 80% of exon 2, encoding the COOH-end of the propeptide and the NHj- 
terminal amino acids of the mature protein. EX5/NC01 primers amplified a 

5 segment of exon 5. containing die COOH-end of die Epo protein and a 3' 
untranslated sequence of about 140 bp downstream of die stop codon: DNA 
sequencing of these 5' and 3' fragments permitted, if necessary, the design of • 
a second generation of species-specific primers (SP). localized outside die 
coding pan of the mature protein (i.e.. in the sequences encoding the 

10 propeptide and the 3' untranslated region). 

Amplification of Partial Intron II Exon 2 Genomic Clones. We first 
explored die efficacy of die IVl and EX2R primers for amplification of 
genomic fragments from differem purified DNA templates. In all the 
reactions, fragments of predicted size were die main (and. when using the 

15 most stringem annealing temperature, often die only) products detected by 
agarose gel electrophoresis. The (direct) sequences of diese PCR-generated 
fragments ate presented in Figure 3. The nucleotide alignmem includes 6 
differem orders of mammals: one primate, human (Homo sapiens); two 
artiodactyls. sheep (Ovis aries) and pig (Sins scrofa); one perissodactyl, horse 

20 (lEquus caballus): one cetacean, spotted dolphin (Stendla plagiodon); diree 
carnivores, dog (Cams fandUaris), cat (FeUs cams) and lion (Panihem leo)\ 
two rodents, mouse (M«j musculus) and hamster (Cricetus cricetus). Cat and 
lion exMbited an almost complete sequence identity, widi only one T to G 
nudeoiide subsUtution near the 5' end of the amplified portion of the first 

25 intron. 

The ability of die intioiuc IVl to anneal to genomic DNA from various 
species and, in combination witii EX2R, to amplify PCR fragments of identical 
size, demonstrated a high degree of conservation of sequence and position 
between mammals. This suggests dial IVl sequence may be involved to some 
30 extent in flic legulation of flie Epo gene expression. The comparison of 
IV1/EX2R sequences also revealed twoi remarkably conserved intromc 
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sequences: AT(T/A)GAATGAA(G/C)GC (SEQ. ID NO: 1] (nucleotides 1046 
to 1058 in the human gene) and A(A/DGGTN(G/OGGG (SEQ. ID NO: 2] 
(nucleotides 1085 to 1094). 

AmpUfication of Partul Exon 5 Fragments from Genomic DNAs. 

5 PCR reactions were also performed applying the EX5/NCX)1 primer pair to 
various purified genomic DNAs. As previously observed for the amplification 
of IV1/EX2R. in all the tested samples a unique main band of about 250 bp 
(as predicted for human and mouse genes) was detected on analytical agarose 
gel electrophoresis. Direct DNA sequencing of these fragments demonstrated 

10 that the bulk of the purified PCR product corresponded to the expected Epo 
exon 5 sequence. However, for some DNA samples, purified from horse 
kidney and several cell lines (BHK and LLC-PKl in particular), unknown 
sequences were present to various extents. These minor contaminating PCR 
products generated extraneous peaks in the sequence data, resulting in 

15 ambiguities at several positions. Nevertheless, computer-edited analyses of 
generated 3' non-coding sequences were sufficient to design, if necessary, 
specific primers vath greater than 95% accuracy (data not shown). 

Cbning of Partial cDNAS Encoding the CompUU Mature Epo 
Protein. As we previously mentioned, we were unable to obtain any 

20 amplification with cDNAs prepared from renal-derived mammalian cell lines. 
Therefore, we tested our battery of primers on reverse-transcribed RNAs 
prepared from Iddney of several mammals. We have successfully amplified 
Epo cDNA from: one primate. Rhesus monkey iMacaca nudlata), one 
carnivore, cat (Fefii cams), one rodent, rat (Rattus norvegicus) and two 

25 artiodactyis. pig (Sitt5cnjfa) and sheep (Om ana). The aligned nucleotide 
sequences are presented in Figure 3. They have been submitted to GenBank 
and have been asagned accession numbers. 

Rhesus monkey, rat and sheep cDNAs were amplified using the 
ATC/NCOl primer-pair combinaiioB. Pig and cat seqtiences were obtained 
30 by the use of a 5' 24-iDer specific primer (SPl-5' 
TGCTrciGCTATCTITGCTGCTGC 3') [SEQ. ID NO: 3]. IV1/EX2R 
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sequencing demonstrated that this exon 2 sequence was shared by the two 
species. In all PCR amplifications, NCOl was used as the reverse primer. 
Conservation of the NCOl sequence among mammals suggests a potential 
biological role of this nucleotide sequence (perhaps in modulating the stability 

5 of Epo mRNA) (Rondon. L J. eiai, J. Biol. am. 266:16594-16598 
(1991)). Considerable homology is also found in the sequenced 3' 
untranslated segment of exon 5 (Figure 4). 

When compared with human Epo. the overall percentages of nucleotide 
identity are, respectively: 93.5 for the Rhesus monkey. 86.7 for the cat. 85.7 

10 for the pig, 83 for the sheep and 75.5 for the rat. Rhesus monkey and 
Cynomolgus monkey (Lin, F.-K. et ai, Gene 44:20U209 (1986)) were 99.6% 
identical. Mouse (McDonald, J-D. etcL, MoL & Cell Biol. 6:842-848 
(1986); Shoemaker. C.B. et c/., Mol. & Cell Biol 6:849-858 (1986)) and rat 
nucleotide sequences showed greater than 93% identity. The two sequenced 

15 artiodactyls, pig and sheep, showed 88% identity.* 

Comparison of Mammalian £/70 Primary Sequences 

Propeptides. The predicted amino acid prq)eptide sequences of several 
mammals arc presented in Figure 5. In the primates, there was strong 
conservation of the sequences of deduced propeptides, with only two amino 

20 add (aa) substitutions at portions -11 (Leu in humans vs. Val in both 
monkeys) and -2 (Leu in humans vs. Pro in botii monkeys). Expression of 
Cynomolgus monkey Epo gene in cultured mammalian cells resulted in tiie 
production of mature monkey Epo tiiat was elongated at tiie N-ternunus by 
tiuee additional rewdues: Val-Pro-Gly (Lin, F.-K. et c/.. Gene ^:201-209 

25 (1986)). As the Rhesus montey has die same substitutions, an identical site 



^ Several PCR attempts on total RNA purified from die horse's kidney 
were unsuccessful, even when using 5' and 3' ^ific equine primers (SP), 
presumably because analysis on 1 % formaldehyde agarose gel demonstrated 
die degradation of the RNA. 
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of cleavage is likely. The Leu lo Pro amino acid replacement ai -2 is 
probably responsible for the differential activity of the signal peptidase 
observed between man and monkeys. The three rodents, mouse, rat, and 
hamster, were found to have identical propeptide amino acid sequences. 

Mature protein. A comparison of the three previously published and 
our five deduced amino acid sequences of mature Epo proteins is shown in 
Figure 6. Erythropoietin is highly conserved among mammals. The amino 
acid alignment showed that more than 63% of the molecule is composed of 
invariant amino acids (106 residues) (Figure 6A). Most of the observed 
substitutions are conservative, involving residues with similar physical and 
chemical properties. The calculated percentages of sequence identity aie 
shown in Table II below. 



15 



20 



TABLE n 
Degree of conservation among mammals 
of the mature Epo protdns 


The percentages of identity among die various sequences were 
determined from the amino acid alignmrat reported in Figure 6. 




% identity 


Primates 


Man/Rhesus Monkey 


90.5 




Rhesus/Cynomolgos 


98.8 


Rodents . 


man/mouse 


79.8 




man/rat 


82.1 




mouse/rat 


93.5 


Artk>dactyis 


man/sheep 


81.0 




man^ig 


81.6 




sheq>^ig 


88.7 


Carnivores 


man/cat 


84.5 



There is 100% conservation of Cys7 at the N-ierminal portion of the 
polypeptide and Cysl61 near the Gtenninus. This disulfide bridge is essential 
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to the formation of a stable and functional cytokine (Wang, F.F. et aL, 
Endocrinology ll&mt-llSfl (1985)). In contrast, while Cys29 is invariant 
in all the species we examined. Cys33 is present only in primates, sheep, pig 
and cat, but not in the two rodents where there is a proline. The lack of 

5 functional importance of this short disulfide loop is underscored by muteins in 
which tyrosine substitutions at each or both of dicse sites have no effett on 
Epo's biological activity (See Example II). 

There is also 1(X)% conservadon of all three asparaginc residues 
responsible for N-linked glycosylation. Even though Epo. deprived of its 

10 carbohydrate either by enzymatic cleavage (Takeuudii, M. «r d., J, Biol. 
Chem. 265:12127-12130 (1990)) or by production in bacteria (Nahri, L.O. 
etal., J. Biol. Chem. 266:23022-23026 (1991)) has full in vitro biological 
activity, survival of the hormone in the circulation depends upon N-linked 
glycosylation (Spivack. J.L. et al.. Blood 73:90-99 (1986)). In contrast, the 

15 0-Unkcd glycosylation site (Serl26 in human B^) is not essential for 
biological function since it is nismg in mouse and rat Epos. 

As shown in Figure 6, there is very high conservation of sequence in 
regions which, by algoridims based on primary strucnire, are predicted to be 
alpha helices (See Example II). The few differences in sequence within these 

20 helical regions are conservative rq)lacements. Indeed, some sites wtWndicse 
pie4icted alpha helices are conserved is coneqwnding h^oes of ofter 
cytokines (Manavalan, P. etal., J. Pmt. Chem. 77:321-331 (1992)). In 
coiitrast, regions predicted to be inter-helical loops are less well conserved, 
particularly die 14 readue strctdi between residues 116 and 130 of human 

25 Epo. In Example II a mutein widi a deletion of readues 1 1 1 through 1 19 is 
shown to have normal stability and biological activity, whereas a deletion of 
readues 122 through 126 fails to produce a detectable protein, probably owing 
to naricedly impaired stabiUty. 

Similar overall 4 a-helical bundle structure ensts or is predicted fat 

30 several otfier cytokines/hormones, such as growdi hormone (GH), granuloqrte- 
maciophage colony-stimulating factor (GM-CSF). IL-3, IL4 and IL-S (Bazan, 
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J.F. et al, Immunol Today 11: 350-354 (1990)). Like Epo, GH exhibits a 
high degree of primary sequence conservation and biological crossrcactivity 
between mammals. The mouse, rat, pig and sheep proteins show about 80% 
amino acid identity with human GH. In other cytokines, however, there is 

5 more sequence diversity among species. For example, the human and murine 
forms of IL-5, GM-CSF and IL-3 show respectively only 69, 67 and 26% 
amino acid homology. Furthermore, unlike Epo or GH, those proteins lack 
inter-species biological cross-reactivity. 

Expression of the MammaUan Epo Hormones. Mammalian ^ 

10 cDNAs were subcloned into the pSG5 plasmid and transiently expressed in the 
monkey Cos? cell line. Supematanis of the transfected cells were able to 
sustain the cellular proliferation of the murine Epo-dependeni HCD57 cell line, 
demonstrating the biological cross-reactivity between species. As the human 
Epo antisera used in our radioimmunoassay bind to Epos from other species 

15 with variable affinity, the amount of produced protein was difficult to 
determine accurately. Experiments are in progress to further characterize Ae 
relative affinities of the various mammalian erythrq)oietins toward murine and 
human Epo receptors. 

Phylogenetic Analyses of Epo Sequences. Nucleotide sequences 

20 encoding the full length mature Epo protein were analyzed by the maximum 
parsimony method (Czelusniak, J. et al.,MetlL Enzyntol J«3:601-615 (1990); 
Stanhope. M J. et d., Mol Phybg. Evol i:148-160 (1992)). An 'all trees' 
set of conq>uter programs determined the parsimony lengA of each of the 945 
unrooted trees formed by the seven cDNA sequences (the terminal taxa) 

25 (Figure 4) and, on ordering diose 945 trees accordii^ to increasing length, 
identified the minimum number of extra nucleodde substitutions needed to 
break up each group in the maximum parsimony (lowest leqgth) tree. As 
shown in Figure 7 and 7A, the mouse and rat are strongly grouped, as are Ae 
human and monkey. The pig and sheqi are more weakly groiq)ed, in accord 

30 widi accepted phylogenedc views, which place the divergence of diese animals 
about 55-60 million years ago. As expected wiA the parsimony reconstruction 
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10 



on only the codon sequences, silent base substitutions occurred much more 
readily than amino acid replacements (Figure 7B). 

The Epo intron 1-cxon 2 sequences shown in Figure 3 were also 
analyzed by the maximum parsimony method. Figure 8 shows strength of 
grouping results based on all 135.135 unrooted trees formed by nine terminal 
uxa (in this case, cat and lion sequences, which are nearly identical, were 
treated as a single taxon). and Figure 8A shows a phylogenetic tree that- 
combines maximum parsimony with establishedphylogenetic evidence. As the 

most parsimonious tree (Figure 7) for these relatively short sequences failed 
to join dog to the feloid (cat and lion) branch (reflecting d>e fact that the 
ancestral split between canoids and feloids traces back to the early carnivores 
at about 60 million years ago), the phylogenetic tree shown in Figure 7A is a 
near most parsimonious tree in which dog joins the feloids. The rodents 
(mouse and hamster) are strongly grouped, as are the feloids (cat and lion), 
15 and at more moderate strength the cetacean (dolphin) is grouped with the 
artiodactyls (sheep and pig). n»ere are other molecular data as well as some 
paleontological data that depict cetaceans originating from early artiodactyls 
(Crelusniak. J. aal. Meth. EnzymoL i«:601-6l5 (1990); Czelusniak, J. 
.aal.. In -Current Mammology". Volume 2, 3rd ed.. H. H. Ganoways. 
20 pp. 545-572, Plenum Publ. Corp, (1990); Irwin. D.M. etal.. J. Mol. 
EvoL 32:128-144 (1991); Gingreicb. ?.D.et al., Science 249:154-151(1990): 
Novacck, M.J.. Nature 556:121-125 (1992)). 
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EXAMPLE II 

Stntclure Function Relationships of Erythropoietin: 
Muteins that Test a Model of Terdary Structure 

Summary 

5 On the basis of its primary sequence and the location of its disulfide 

bonds, we propose a structural model of the erythn^oietic hormone 
erythropoietin (^) which predicts a 4 a-helical bundle motif, in common 
with other cytokines. In order to test this model, site directed mutants were 
prepared by high level transient expression in Cos7 cells and analyzed by a 

10 radio-immune assay and by bioassays utilizing mouse and human Epo 
dependent cell lines. Deletions of S to 8 residues within predicted o helices 
resulted in the failure of export of the mutant protein from the cell. In 
contrast, deletions at the N-ternunus (Al-6), the C-tcrminus (A163-166), or in 
predicted interhelical loops (AB: A32.36, A53-57; BC: A78.82; CD: 

IS A11M19) resulted in die export of immunologically detectable ^ muteins 
tiiat weie biologically active. Tht mutein A48-52 could be readily detected 
. by RIA but had markedly decreased biological activity. However, 
rq)lacement of each of tiiese deleted residues by serine resulted in 1^ 
mutpins witii fidl biological activi^. Replacement of Cys29 and Cys33 by 

20 tyiosii^ leddues also resulted in the export of fully active Bpo. Therefore this 
small disulfide loop is not critical to stabili^ or function. The 
pitiperties of the muteins that we tested are consistent witii our proposed 
model of tertiary structure. 
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Introduction 

Humoral regulation of red blood cell production was first proposed at 
the beginning of this century (Camot. P.. etal.. CR. Seances Acad. 
Sci. 745:432-35 (1906)). Convincing physiologic experiments documenting 
5 the existence of erythropoietin (Epo) (Krumdieck. N., Proc. Soc. Exp. Biol. 
Med 54:14-17 (1943); Reissmann, K.R., B/oorf 5:372-3 80 (1950); Erslev, A., " 
B/ood 5:349-357 (1953); Jacobson, LO.. et ai. Nature 779:633-634 (1957)) 
were followed by its purification (Miyakc, T.. etal.. J. Biol, 
am. 252:5558-5564 (1977)) and partial structural characterization (Lai, P.- 
10 H.. et al.. J. Biol. Chem. 267:31 16-3121 (1986)). The molecular cloning of 
this biologically and clinically important cytokine (Jacobs. K.. etal.. 
Nature 313:SQ^m (1985); Lin. F.-K.. etal., Proc. Nat. Acad. Sd. 
USA 82:7580-7585 (1985)) has led to further understanding of its properties 
(Sasaki, H.. etal.. J. Biol. Chem. 262:12059-12076 (1987); Davis. J.M.. 
15 et al.. Biodiemistry 26:2633-2638 (1987)). 

The binding of Epo to its cognate receptor (D* Andrea, A.D.. tf al.. 
CeU 57:277-285 (1989)) on erythroid progenitors in the bone marrow results 
in salvaging these cells from apopiosis (Kouiy. M.J., etal.. 
Science 248:378-381 (1990)). allowing them lo proliferate and differentiate 
20 into circulating erythiocytes. The Epo receptor is a member of an ever 
enl.arging femily of cytokine receptors (Baran, F,. Biodum. Bhpkys. Res. 
Cornmun. 764:788-796 (1989)). In like mamier. ^ shares weak sequence 
homology with other members of a femUy of cytokines which also include 
growdi hormone, prolactin, IL.2, IL-S, IL4. 11.5, IL^. IL-?, GCSF, OM- 
25 CSF, M-CSF, oncostatin M. leukemia inhibitoty factor and cUiaiy 
- neurotrophic factor (Parry, D.A.D., et al.. J. Mol. Recognition 7:107-110 
(1988); Bazan, F.. Immunobgy Today 77:350-354 (1990); Mareivalan, P., 
etal.. J. Protein Chem. 77:321-331 (1992)). The genes encoding these 
proteins have similar numbers of exons as weU as a clear relationship between 
30 intron-cxon boundaries and predicted o-heUcal structure. These similarities 
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have led to the prediction that this family of cytokines share a common pattern 
of folding into a compact globular structure consisting of four amphipathic 
a-helical bundles. Such theoretical models of the structures of human growtii 
hormone (Cohen, F.E.. etai. Protdns: Struct. Func. Genet. 2:162-166 

5 (1987)) and 1^4 (Curtis. B.M., etd., Protdns: Strm. Func. 
Genet. Ji:lll-119 (1991)) have been in remarkably good agreement with 
subsequent structures established by X-ray diffraction (HGH) (Abdel-Meguid. • 
S.S., a al. Proc. Natl. Acad. Sd. USA 54:6434-6437 (1987); deVos. A.M.. 
et al. Sdence 255:306-312 (1992)) or by multidimensional NMR (IL4) 

10 (Redfield. C.,et al. Biochemistry 30:\ 1029-11035 (1991); Powers. R., et al.. 
Sdence 256:1673-1677 (1992)). Moreover, die crystal structures of GM-CSF 
(Diedcrichs, K., etal, Sdence 254:Vll9-\m (1991)) and monomeric M- 
CSF (Pandit, J., et al, Sdence 258:1358-1362 (1992)) are also in reasonable 
agreement witii their predicted structures. 

15 Thus for, the structure of Epo has not been analyzed by either X-ray 

diffiaction or by NMR. In order to begin to gain an understanding of 
structuit-function relationships, we have taken a three-pronged approach: 

a) Sequence determination of ^o from mammals of different 
orders in Older to establish regions of homology. This work is described in 

20 Example I. 

b) Construction of a model of the three-dimensional structure of 
fdlowcd by Ac design and prqniation of mufdns diat test dus model. 

These experiments are presented in this Exanqile. 

c) Deagn and testing of muteins diat pnmde information on 
25 receptor binding donjain(s). Tins woric is picsented in Example HI. 
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Materials and Methods 



Compuur-based Modeling of Structure 

Prediction of Secondary Structure. Epo sequences from human, 
monkey, mouse, rat, sheep, pig, and cat were aligned (See Example I), and 
5 examined using a hierarchical approach to secondary structure piedicUon that 
assumes that these proteins are members of the ala folding class (Uvitt. M. 
a al.. Nmre26h5S2-S5i (1976)). First, the pattern-based method of Cohen, 
F.E. et al.. Biochemistry 25:266-275 (1986) for mm prediction was used to 
delimitsequenceblocks likely tocontainsccondary structure. Predictions using 

10 d,e methods of Gamier. J. « cL. J. Mol Biol. i2ft97-120 (1978) and aou. 
P,Y. ad.. Ann. Rev. Biochm. 47:251-216 (1978) suggested a-helical 
regions within these blocks. Fmally, heUcal wheel projections were used to 
examine and then limit helix length based on pieserving amphipafliic character 
as codified in the work of Presnell. S.R. et al.. Biochendstry 37:983-993 

15 (1992). The locations of glycosylation sites were also used to suggest helix 
boundaries. 

Tertiary Structure Prediction. Eariicr investigations have revealed the 
general principles of helix-to-helix packing in globular proteins (Richmond. 
TJ'., etal.. J. Mol. Biol. lJ9-33%55 (1978)). Exploring these principles. 

20 Coh«i, F.E. et al.. J. Mol. Biol. 732:275-288 (1979) developed a metbod for 
the generation of three-dimensional protein strucwres from the secondary 
strucnire assignment "niese metiwds have been applied to myoglobin, tobacco 
mosaic virus coatprotein. growtit hormone, o- and /J-interferon. lL-2.and IL4 
(Cohen. F.E.. a al.. J. Mol. Biol. 732:275-288 (1979); Cohen. F.E.. et al.. 

25 Science 23*349-352 (1986); Cohen, F.E.. etal.. Proteins: Struc. Func. 
Gen. 2:162-166 (1987); Sternberg. MJ.E.. etal.. Int. J. Biological 
Maamokades *137-144 (1982)). 
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The algorithm for ternary structure generation is divided into four 
computations. Theprogram aapatdi identifies clusters of hydrophobic residues 
within the putative helices that could mediate helix-helix interactions 
(Richmond. T.J., et ai.. J. MoL Biol. 779:537-55 (1978)). Aafold generates 

5 all possible helix pairings according to the location and geometric preferences 
of the interaction sites. Aabuild generates the three-dimensional models of all 
possible structures from the list of helix pairings (from aafold) and subject to 
stearic restrictions and geometric constraints on chan folding. In the final 
step, aavector ^plies the user defined distance constraints (e.g., disulfide 

10 bridges) to the structures generated. At this stage, coordinates have been 
specified only for residues in the core a-helices. For residues in sequentially 
distinct loops, lower bounds on the inter-residue distances can be inferred from 
the relevant helix terminus. 

FnpanlUtn ofEpo Mutans 

1$. Constmction of the Mutdgenic/Mammalian Expression Plasmid. A 

M13 plasmid, contamng a 1.4 Kb EcoRl-EcoRI human ^ cDNA insert 
QiHEfo FL12) was a gift ftwn <3enetics Institute (Cambridge, MA) (Jacobs, 
K., etal., Nature SlSitOMlO (1985)). A 943 bp EcoRI-BglH fragment. 
ooiresiX>nding to Ibe cotofktt oodng sequence of die wild Qipe human 

20 eiythropcMedn, indwffing untranslated unions 216 bp iq)stream and 183 bp 
downstream, was inserted into the mammalian expresam plasmid pSG5 
(Stratagene) (Sambrook, J., etal.. In "Molecular Cloning: A Laboratory 
Manual", 2nd cd.. Cold Spring Harbor Laboratory, Cold Spring Haibor, NY, 
pp. 1.38-1.41 (1989)) and named pSGS-EpoWT. 

25 Site-Directed Mutagene^ was caniedomaooordlng to the protocol 

described by Knnkel, TJL, etoL, Meth. EazymoL ISi-^SI-m (1987). 
Single-stranded DNA was rescued from the pSG5-Epo/WT phagemid grown 
overnight in E. coU CJ236, in 2XYT media containing M13K07 helper phage 
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(In Vitrogen) and 70 /xg/ml kanamacyn (Sigma). The resulting uracil- 
coniaining ssDNA was used as a template for mutagenesis. Oligonucleotides 
(24 to 46 mcr) were synthesized with their 5' and 3' ends complementary to 
the target wild type Epo sequence. A large variety of mutations (base 

5 substitutions, deletions and insertions) were created at the centers of the 
mutagenic primer sequences. Annealing of the phosphorylated primers (10:1 
oligonuclcotide/DNA template molecular ratio) was performed in 10 /il of a 
20 mM Tris-HCl pH 7.4, 2 mM MgClj, 50 mM NaCl solution. The reactions 
were incubated at 80*C for 5 minutes, then allowed to cool slowly to room 

10 temperature over a one hour period. The DNA polymerization was initiated 
by the addition as a mix of 1 fil of lOX synthesis buffer (100 mM Tris-HCL 
pH 7.4, 50 mM MgClj, 10 mM ATP, 5 mM each dNTPs, 20 mM DTT), 0.5 
III (8 units) of T4 DNA ligase and 1 ^1 (1 unit) of T4 DNA polymerase 
(Boehringcr Mannheim). After 2 hours at 37'C, 80 id of IX TE was added. 

IS 5 III of the diluted reaction mix was used to transform competent E. cqU 
NM522 (ung*, dut+). 

Since a 40-80% mutation yield is normally obtained, 4 to 5 double- 
stranded plasmid clones from each reaction were sequenced with 7-deaza- 
dGTP and Sequenase (U.S. Biochemical) (Sanger, F., it aL, Proc. Natl. 

20 Acad. Sd. USA 7*5463-5467 (1977)). As a rule, the entire coding sequences 
of the Epo mutants were examined for the presence of unwanted mutation by 
sequendng or restriction enzyme mapping. 

Ptvduction of WId Type Epo and Epo Muteins in Mammatian Cells, 
Cos! cells grown to approx. 70% confluency were transfectcd witii 10 /ig 

25 recombinant plasmid DNA per 10 cm dish using the calcium pho^hate 
precipitation protocol (Kingston, R.E., et al., In "Current Protocols in 
Molecular Biology", Green Publishing Associates & Wley Interscience, NY, 
pp. 9J.1-9.L9 (1991)). As a control of transfection efficiency, in several 
experiments 2 ng of pCHl 10 plasmid (Pharmacia) was co-transfected and fi- 

30 galactosidase activity measured in the cytq)lasmic extracts. 
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RNA Blot-Hybridization Analysis. Total RNAs were prepared from 
cultured Cos7 cells (Chirgwin, J.M., etal.. Biochemistry /S:5294.5299 
(1979)) and 2 iig samples electrophoresed on a 1.196 agarose gel containing 
2.2 M formaldehyde. Transfer to GeneScreen Plus filters (New England 

5 Nuclear) and hybridization with a '^P-labcled wi Epo probe were carried out 
as previously described (Faquin, etai. Blood 79:1987.1994 (1992)). 

Quantitation of Transiently-Expressed Recombinant Epos. The 
amount of secreted protein in the supematants of transfected Cos? was 
determined by a radioimmune assay (RIA), The RIA was performed using a 

10 high titer rabbit polyclonal antiserum raised against the human wild type Epo 
and produced in our laboratory. '^Habeled recombinant Epo was obtained 
from Amersham. Details of the protocol have been published elsewhere 
(Faquin, W.C., et aL Exp. Hematol. 21: 420426 (1993)), 

Immunoprecipitation of ^^S-labeled Epo Proteins. Three days after 

15 transfecuon, the Cos? monolayers were washed extensively with IX PBS and 
the cells incubated for 20 minutes at 3?**^ in 2 ml of Mer/Cys" Miiumum 
Essential Media Eagle modified. In each culture dish, 100 p\ of TRAN35S- 
LABEL (pS]-cysteinc and [^5].methionine, 10 mCi/ml, ICN Biochemical) 
was then added. After two hours, the conditioned media were harvested and 

20 cellular extracts were prepared by lysis in radioimmune precipitation buffer 
(50mM Tris-HCI pH 8.0, 150 mM NaQ, 0.02% (w/v) sodium azide, 0.196 
(w/v) SDS, 0.5% (w/v) sodium deoxycholate, 1 % (w/v) Triton X-100, 1 mM 
phcnylmetiianesulfonyl fluoride and 1 p^m\ aprotinin). Samples were 
precleaied with rabbit preimmune serum/protein A-Sq>harose CL-4B 

25 (Pharmacia) for two hours. Immunq)reciphations were performed overnight 
wiA our polyclonal antibody ^ific for human recombinant wild type Epo 
and immunoadsorbed with protein A-Sq>harose CL4B. Immunoprecipitates 
were run on 15% SDS-polyacrylamide gels (Laemmli, U.K., Nature 
227:680-685 (19?0)) and analyzed by autoradiography after treatment widi 

30 Enhance (New England Nuclear). 
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Bioassays. The dose-dependent proliferation activities of wild type Epo 
and Epo muteins were assayed in vitro using three different target cells: 
murine spleen cells, following a modification of the method of Krystal 
(Krystal, G., Exp. HematoL 77:649-60 (1983); Goldberg, M.A.. et at., Proc. 
5 Natl. Acad. Sci. USA «:7972-7976 (1987)); Epo responsive murine 
eiythroleukemia cell line, developed by Hankins (Hankins, W.D., etal.. 
Blood 70:173a (1987)); and human Epo-dependent UT-7/EpoceIl line, derived • 
from the bone marrow of a patient with acute megakaryoblastic leukemia 
(Komatsu, N.. et aL. Cancer Res. 57:341-348 (1991)). After 22 to 72 hours 
10 of incubation with increasing amounts of recombinant proteins, cellular growth 
was determined by pH]-thymidine (New England Nuclear) uptake or by the 
colorimetric MTT assay (Sigma) (Yoshimura, A., etal., Proc. Natl. Acad. 
Sd. USA S7:4139-4143 (1990)). 

Bacterial Expression. The wild type ^o target, corresponding to the 
IS nucleotide sequence coding for the mature protein, was PCR-amplified using 
appropriate primers* In the sense primer an Ndel site (CATATG) was placed 
inmiediately 5' to the Alal codon of the mature protein. In the antisense 
primer a Bglll site was placed 3' to the TGA stop codon. After enzymatic 
digestion, the 516 bp PGR fragment was inserted in an Ndel/BamHI-cut 
20 pET16b plasmid (Novagen), which has a T7 promoter followed immediately 
by the lac operator. IPTG induction of transformed E. coU BL21 (DE3) (17 
RNA polymerase^. Ion', ompT) resulted In high levels of cxpresaon of a 
fu^onpiotem with a ten histldinestretdi at die amim> tennis Tbeoligo- 
His tag allowisd the binding of the produced (HiSgo)-^ on a nickel affinity 
25 resin and its elution by increasing imidazole concentrations in presence of 
phenylmethylsulfonyl fluoride (Sigma). Most of the produced protein formed 
insoluble aggregates and was solubilized and affinity-purified under denaturing 
conditions in 6 M guamdine*HCU Oxidative refoldiqg was performed by 
overnight dialysis at 4T against SO mM Tris-HCI pH 8.0, 40 ftM C11SO4 and 
30 2% (wt/vol) Saifcosyl. Soluble protein was fimher dialyzed against 20 mM 
Tris-HQ pH 8.0, 100 mM NaQ, 2 mM QCi. and subjected to factor Xa 
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(New England Biolabs) cleavage to remove the NHj-polyHis sequence. 
Monitoring of the fusion protein following induction and during the various 
steps of purification was done by elecu^ophoresis on a 15% polyacrylamide- 
SDS gel, stained with Coomassie Brilliant Blue. Alternatively, the His-Epo 
5 fusion protein was detected on Western Biol (Gershoni, J.M., et al, Anal 
Biochem. 757: M5 (1983)), using a 1/2000 dilution of our wt native Epo 
polyclonal antibody and a second biotinylated rabbit-specific antibody which 
is detected with a streptavidin-alkaline phosphatase conjugate (Amersham). 
In Vitrv Transcription/Translation. Sense and antisense primers, 

10 creating new Bglll sites respectively S' and 3' of the initiator and stop codons, 
were used in a polymerase chain reaction on pSGS-EpoAVT template. After 
Bglll cleavage, die 594 bp PCR-fragment was subcloned into p SP64T (Krieg. 
P.A., etal., NucL Acid. Res, 72:7057-7070 (1984)). This SP6 containing 
vector provides 5' and 3' flanking regions from Xenopus ^-globin mRNA, 

15 which allow efficient in vUro tFanscription/translation. Previous experiments 
showed poor yields of in vitro translated protein, when using the GC-rich 
natural S' 1^ untranslated region (UTR). One step in yntro transcription/ 
translation was carried out by incubation of 1 /ig of circular p64T-Epo in a SO 
reaction volume of SP6-TnT coupled rabbit reticulocyte lysate system 

20 (Promega), in presence of [^5]-cysteine (1200 Ci/mM, New England 
Nuclear). In some cases, canine pancreatic microsomal membranes wane 
added to die reaction mix. A purified GST-human Epo reoq>tor extracellular 
domain fufion protein (EREx) was a gift fiom W. Harris and J. Winkelman, 
and the binding of the ^S*labeled translation products onto EREx-glutatiuone 

25 agarose beads were performed as described (Harris, K.W., et al., /. KoL 
Chm. 267:15205-15209 (1992)). 
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Results 

Construction of a Model of the Three-Dimensional Structure of 
Erythropoietin 

From an analysis of the putative Epo helix sequences, aapatch 

5 identified eight possible helix-helix interaction sites. In principle, these sites 
could be used to generate h6 x 10* structures. Of these, only 706 maintained 
the connectivity of the chain and were sterically sensible. These structures 
resembled four helix bundles, an increasingly common motif in protein 
siniciure (Presnell, S.R., et aL, Proc. NatL Acad. ScL 86:6592-6596 (1989)). 

10 The sirucnires that were not compatible with the native disulfide bridge 
between Cys, and Cyswi were eliminated. This reduced the total number of 
structures from 706 to 184 (total computer time approximately 1 hr. on a 
Silicon Graphics IRIS 4D/35G). The remaining structures were then rank 
otdeied by solvem*accessibIe sur&ce contact area, a measure of the validiQr 

15 of model structures. The most compact structures were right handed, all anti- 
paiallel four-helix bundles with no overhand connecdons, but Ais may be an 
artito of a fidluie to add the polypeptide chain that forms the loq>s to die 
helical core constructed by aabuild. The odier less compact structures were 
lef^-handed four-heluc bundles widi two overhand kx^, a topolqgy pre^ously 

20 seen in die structures of IL4 and growdi hormone. We suspect diatdiis is die 
lilcdy structure for Epo, The consensus for assignments of putative o-helioes 
in human Epo are summarized in Table III. First, analysis of die topological 
distribution of known four-helix bundle structures indicate diat nearly all 
examples have an antipaiallel orientation (Presnell, S.R., et aL, Proc. NatL 

25 Acad. Sd. 56:6592-6596 (1989)). Second, die left-handed four-helix bundles 
wiA two overtiand connections arranges the four amphipaduc helices to form 
a compact hydrophobic core. Finally, a path for die loop regions of Epo can 
be imagined by analogy to IL4 and growth hormone diat preserves the 
compact globular nature of die Epo model structure. Figure 9 shows 
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schemauc representations of the possible topological interactions between the 



four-anti-parallel a-heiical bundles. 







Table III 




Predicted 0«iielicai regions of Uie mature erythropoietin protein 


Helix 


N-tenninus 


C-terminus 


Potential Helix-Helix 
. Interaction Sites 


A 


9 


22 


19 


6 


59 


76 


63. 67. 70. 71 


C 


90 


107 


95. 102 


D 


132 


152 


141 


Dau were obtained using the various algorithms for secondary and 
tertiary structure generations described under Materials and Methods. 



Several authors have suggested that the helical cytokines form a 
structural superfamily (Cohen, F.E., etd., Science 23^:349-352 (1986); 
Parry. D.A.D., eiai, J. Mol Recognition 7:107-110 (1988); Bazan, 

15 Biodiem, Biophys. Res. Comm. 164:lSi-195 (1989); Cosman, D„ et cl., 
Trends Biodiem. Sd. JJ:265-270 (1990); Bazan, J.F., Proc. Natl. Acad. Sd. 

57:6934-6938 (1990)). On the basis of both the mature protein and die 
individual a-belioes, Epo seems to be more closely related to growth hormone, 
prolactin, ILr6 and GM-CSF rather than the other members of the helical 

20 cytokine superfomily. Nevertheless, recent improvements in algorithms for 
the identification of distant evolutionary relationships between proteins ftom 
structuial fii^eiprims suggested that it might be possible to align the IL-4 
structure to the 1^ sequences. The Gseiibers (Bowie, LU., et d.. 
Science 253:16^-170 (1991)) structural environment and 3D-1D profile 

25 methods are a powerful tool for recognizing diat a sequence is compatible mth 
a known structure, e-g., a four-helix bundle. The NMR structure of IL4 
from Smidi, LJ., etal., J. Mol. Biol. 22^:899-904 (1992) was used to 
construct a 3D-1D profile. A mixture of sequences including four bdix 
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bundles, globins, and non-helical sinjctures were aligned against the 1L4 
profile. Not surprisingly, the IL-4 structures from human and mouse gave the 
highest scores {V = 22.8 and 8.1). However, the other known four-helix 
bundle cytokines known to share a similar fold with IL4, e.g., human growth 

5 hormone (Abdel-Meguid, S.S., et aL, Proc. Natl. Acad. Sd. 8^:6434-6437 
(1987)) (Z = 2.3) and GM-CSF (Diederichs. K., etcL, Science 
254:1779-1782 (1991)) (Z = 2.3) fared no better than some globin sequences 
(Kuroda's and slug sea hare globin. Z = 5.0 and 4.8) that adopt a distinct 
tertiary structure. The results for the human and sheep Epo sequences were 

10 also ambiguous (Z = 1.6 and 0.8). These results suggest that while profile 
methods are a powerful tool for recognizing structural similarity, their failure 
10 identify homology does not exclude the possibility that two proteins share 
a common fold. For distantly related or unrelated structures, current profile 
methods cannot replace de novo methods for tertiary structure prediction. 

15 Design and Expression of Epo Muteins thai Test the Proposed Structure 

To test the proposed four a-helical bundle structure of erythropoietin 
and at the same time to attempt to locate functional domains, we created by 
site-directed mutagenesis a series of deletion, insertion and replacement 
mutants. These muteins were designed to analyze the princq)al predicted 
20 structural features of the molecule: a-helices, interconnecting loq)S, as well 
as tiie NH, and COOH termini. Structutal and functional implications of tiie 
disulfide bridges and the glyoosylation sites were also investigated. 

o Helices. Short amino acid deletions were prq>aied in, or close to, 
the predicted A, B, C and D o helices. Human wild type and muteins were 



25 'Z-scores ate used to describe tiie normalized weight associated with a 
profile score. A distribution is built from a collection of sequences widi a 
mean2scoreof 0.0 and a standard deviation of 1.0. Z-scores greater than 
6.0 ate associated witii significant alignments. Z-scores between 3.0 and 6.0 
may or may not be smictuFally relevant. 
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transiently expressed in Cos? cells. Northern blot analyses demonstrated that 
all the mutant plasmids produced about the same amoum of mRNA as that of 
the wild ^e (data not shown). Yet, no detectable amount of Epo protein 
could be found in the Cos? supematants, either by radioimmunoassay or by 
bioassay using various ^>o-dependent cell lines. Table IV summarizes these 
findings. 



mutants. 
A 12-16 

A 65-69 



A 96-100 
A 105-109 



A 122-126 

A 131-135 
A 140-144 
A 142-150 
A 152-155 

A 156-160 



9 
I 

22 



59 
i 

76 



90 
107 



132 
1 

152 



TABLE IV 
Short deletions in, or close to, a helices 



A HELK 



B HEUX 



C HELIX 



D HELIX 



RNA levels comparable to WT. 

no detectable Epo in tbe Cos? 
supernatant, bob by RIA and 
bioassay. 



Predicted N and C termiiu of eadi a lieUx are indicated in tbe vertkal boxes.. 

An example of SDS-PAGE of immunoprecipitants fmm in ^nvo "S- 
labeling is presented In Figure 10. As expected, when Cos? cells were 
transfected with pSG5-^)o/WT, a 3S-3? KD band was detected in die 
supernatant In contrast, the deletion mutants (Table IV) could be detected in 
5 cellular atracts but were not eqwited from the cells. Figure 10 shows the 
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cytoplasmic retention of the mutein A140.144, lacking four residues in the 
middle of the predicted D helix. The apparent molecular weight (around 28 
kD) is less than expected for a five-amino acid deletion. Therefore not only 
the secretion, but also the glycosylation, seem to be impaired. None of these 

5 muteins had deletion of glycosylation sites. It is likely that full glycosylation 
of Epo requires conservation of its molecular architecture. Similar results 
(reported in Table IV) were obtained for all the muteins having partial deletion- 
of an a helical peptide segment. 

Because contaminants in crude Cos? cellular extracts severely interfere 

10 with die radioimmunoassay, no direct Epo quantitation was possible. . 
However, aliquots of hypotonic extracts of Cos? transfected with wild type 
Epo were able to sustain HDC5? proliferation. No similar biological activity 
was found for muteins with limited deletion of a helices. 

Interconnecting Loops. The peptide segment joining A and B helices 

15 presents several interesting features (Figure 11). AB loop consists of 36 
amino acids. Two N-glycosylation sites and a small disulfide bridge are 
located in the first half and tiieir biological implications will be discussed later. 
The C(X)H end of the AB loop contains a stretch of amino acids that is 
strongly conserved among mammals (See Example I). Alignments of bumaiu 

20 monkeys, cat. mouse, rat. pig. and sheq) Epos showed a consensus sequence: 
DTKVNFYAWKR(M/D(E/D)VG (residues 43 to 57) [SEQ, ID NO: 4]. 
Three deletions were constructed: A43-47. A48-52, and A53-57. and 
tranaendy expressed in Cos7. The amount of mutdns detected by RIA in the 
supematants of transfected cells was 10 to 40% lower dian observed widi wild 

25 type Epo (Figure 11 A). Nevertheless, the tiucc secreted muteins were 
biologically active. However, because A48-52 exhibited a marked decrease 
of the specific bioacti^ty, this site was studied in more detml by means of 
serine replacements. Krystal a vzvo bioassay as well as HCD57 and 
ljT-7/^ in vitro bioassays showed that these Ser mutants had biological 

30 activities similar to that of vwld typt (Figure IIB), Therefore, the observed 
decreases in botii RIA and bioassay for the diree deletion mutants are likely 
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to be the result of changes of structural conformation. The long length of loop 
AB may be critical for the up-up-down-ddwn topogrs^hy, A shorter AB 
segment may impose a strain on the interhelical connection. Chou and 
Fasman algorithms (Chou, P.Y.. et aL, Ann. Rev. Biochm, 47:251-216 

5 (1978)) predicted a short j3-sheet structure from residues 44 to 51 (<Pa>; 
<Pb>. 1,005 < 1.196). The presence of a short region of i5-sheet in the 
conneciion between helices 1 and 2 (A and B) have been documented in the- 
analyses of the three-dimensional structures of IL-4 (Redfield, C, et ai, 
Biochemistry 30:n029-\m5 (1991); Powers, R., etaL, Science 256:1673- 

10 1677 (1992)). GM-CSF (Diederichs, K., ei ai. Science 254:1779-1782 

(1991) ), and monomeric M-CSF (Pandit. J„ et aL, Science 255:1358-1362 

(1992) ). In contrast, in human GH a short segment of a helix is found at the 
same location (Abdel-Meguid, S.S., et al.. Proc, Nail Acad. Sci, USA 
84:6434-6437 (1987)). The structure/function implications of these short 

IS features are not yet understood. 

Helix B is linked to helix C by a much shorter segment (residues 77 
to 89) and contains in its center the third N-glycosylation site (Asn83). When 
the A78-82 mutein was expressed, a secreted protein was detected in the 
conditioned medium and conferred proliferative bioactiviiy on Epo-dq>endent 

20 cell lines (see Figure 16). 

A dnular long crossover oonnecdon (23 amino acids) is found between 
helix C and heln D. In contrast to v/bst we previously observed for loop AB, 
a iaige deletion of lune residues at poation 111-119 or a seven amino acid 
insertion of a myc qiitope after residue 116 did not affect die secretion of 

25 diese mutdns (Figure 12). Funhermore, tiiese two protdns had normal 
specific activi^, as seen by die ratio of bioassay to RIA. Our rabbit 
polyclonal antibody raised against die native form of die human wild tjfpt fully 
recognized the two mutants, demonstrating diat the overall spatial 
conformation of Epo was well preserved. According to die algoridim of 

30 Emini, EA., et al., I WoL 55:836-839 (1985), die residues 111-119 are 
predicted to be at die surface of die molecule. Primary amino acid alignments 



wo 94/24160 



PCT/US94/04361 



-58- 

of mammalian Epo showed a large variation in the sequence of residues 116 
to 130. including amino acid deletion, insertion, and substitution (See 
Example I). Surprisingly, when the deletion A122-126 mutein, which 
removed the O-glycosylation site (Ser 126). was tiansiendy expressed in 

5 monkey cells, protein secretion was inhibited. Both rodents, rat and mouse, 
lack the O-glycosylation site because of a Scrl26 to Pro replacement. 
Furthermore, when a Serl26 replacement mutein was expressed in normal • 
CHO cells. (DeLormc. E., et aL. Biochemistry 557:9871-9876 (1992)) or 
when wild type Epo was expressed in cells having a defect in 0-linkcd 

10 glycosylation (Wasley. L.C.. etal.. Blood 77:2624-2632 (1991)). neither 
secretion nor biological activity were impaired. Therefore, failure of secretion 
of die A122-126 mutein may be die result of some other structural alteration. 
In particular, die proline residue at position 122 is invariant among mammals. 
NandC Termini Deletion of residues 2 to 6 only slightiy affected 

15 die procesang of a biologically active protein (see Figure 16). This deletion 
may impair cleavage of the propeptide, tiierefore explaining the lower yield 
of secreted Epo mutein in comparison to diat of wild type. The fact titat die 
mature monkey protein has an elongated (Val-Pro-Gly) N-terminus strongly 
suggesB that die N-terminal part is not involved in die bioactivity of the 

20 mrteoilc. Further evidence comes from the results, iqwrted below, on die 
N<iK>ly*His-^ fusion protein expressed in £. coff, and also from the identical 
binding of /b vitro translated "S-Iabeled wild type Epo onto EREX-glutafluone 
' agarose beads (Figure 13), with or widiout addition of canine pancreatic 
microscMnal membranes which peroUt deavage of the propeptide.' 

25 The C-tenninalsequaice following helix Dean clearly be divided into 

two distinct domains, separated by C:ysl61. The residues 151 to 161 were of 
^al interest because th^ are MgMy conserved among mamnwls (See 
Example I). There are only two substitutions: Lysl64 rq>laced by a Thr in 



30 



^All die mutants described in diis example were subcloned into pSPCi4T 
plasnud. 
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artiodactyls and cat, and AIal60 is replaced by a Val in mouse Epo. Both the 
A1S2-1SS and the A1S6*160 muteins remained in the cytosol of the transfected 
Cos7 (Table IV). One possible explanation is that the residues 152 to 160 
may, in fact, participate in the D helix. We predict that Glyl51 is the break 

5 point of the structure. However, it is possible that this residue causes only a 
bend in the a helical structure and helix D may extend to Glyl58. 

The C-ierminal part of the protein (residues 162 to 166) is clearly not- 
involved in any structural or functional feature. Thus, the deletion of the four 
last amino acids or the replacement of residues 162-166 by either a KDEL 

10 sequence or a poly-histidine sequence^ did not modify the specific activity of 
the erythropoietin (Figure 14). Radioimmunoassay revealed that the secretion 
of the KDEL-tail mutein in the niedia of transfected cells was 45% less than 
normally obt^ned with the wild type Epo. However, when compared to the 
wild type, this mutein had more biological activity in the hypotonic Cos? cell 

15 extracts. The KDEL COOH-terminal sequence has been shown to be essential 
for the retention of several proteins in the lumen of die endoplasmic reticulum 
(Andres, D.A,. era/.. L BioL Oim. 266:14288-14282 (1991)). 
. Nevertheless, because of overproduction in transiently expressed cells, a laige 
percentage of recombinant protein escaped into die media. 

20 DisuVlde Bridges. Waqg etd. demonstrated diat die biological 

acdyity of ^ was lost irreverably if die sulfhydryl groq>s were alkylated 
(Wang. F.F.. etd., Endocrinology ii6:2286-2292 (198S)). The mature 
human ^ has two internal disulfide bonds: Cys7s=Cysl61, linkiqg die NH, 
and CpOH termiid of die protein and a small bridge between Cy^9 and 

25 Cys33. Cys33 was previously changed to Pro by site-directed miitagene^^ 
and die resulting protein was r^ned to have gready reduced in }iiro 
biological activity (Lin, F.-K., Molecular and CelUdar Aspects cf 



^e poly-His tail wild type mutant was purified by means of nickel 
affiiuty chromatography which will enable quantitation of cytosolic-retained 
30 mutants. The (His)6 CQOH-terminal sequence has been appended to all die 
mutdns described in dus example. 
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Erythropoietin and Erythropoiesis H8:23-36, NATO ASl Series. Springer- 
Verlag. Berlin Heidelberg (1987)). However, rat and mouse Epos have the 
same substitution and yet exhibit full cross-species bioactivity. To resolve the 
role of the small disulfide bridge in human Epo function, we created a 

5 C29Y/C33Y double mutation. The resulting rautcin was normally processed 
and showed the same in vitro bioactivity as the wild type (Figure 11 A).' 
Furthermore, the deletion of five amino acid residues (A32-36) did not impair- 
the secretion of a biologically active mutein. These data demonstrated that only 
the native and fully conserved disulfide bridge Cys7 =Cysl 61 is crucial for the 

10 preservation of the molecular strucnire of erythropoietin. 

Functional Role of the Glycosylation. Natural or recombinant human 
Epo is a heavily glycosylated protein; 40% of its molecular weight is sugars 
(Davis. J.M., etal.. Biochemistry 26:2633-2638 (1987)). The protein has 
three N-linked oligosaccharide chains, located at amino acid positions 24 

15 and 38 (in predicted loop AB) and position 83 (in loop BC). It has one 0- 
linked caibohydiate chain at position 126 (in loop CD), which is missing in 
rodents. The role of Aese sugar chains in the biological activity of the human 
hormone has been extensively studied. Site-directed mutagenesis at the N- 
. glycosyladon ales demonstrated that even though Ae sugars were important 

20 fiJT piopcr biosynthesis and secretion, their removal did not affect in vitro 
activity. This finding was corroboiated by several investigators (Yamaguchi, 
K.. et aL, J. Bid. Om. 266:20434-20439 (1991); Dub«. S., et d., J. Biol. 
Chm. 263:17516-17526 (1988)). However. Takeuchi. M.. et d., J. Bid. 
am. 265:12127-12130 (199Q) found fliat N-glycanase digestion results in 

25 almost complete loss of biological aocmty. In contrast, ihwe is general 
consensus tiiat glycosylation plays a key role in die biological activity of the 
hormone in vivo. Various npons have demonstrated dot die N-Iinked sugar 
diains enhance die stidiility and sunaval of Bpo in die blood stream (Fukuda, 



'Two angle rq)lacemem muteins (C29Y and C33Y) were also stable and 
30 had full biological activity (results not shown). 
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M.N„ etcL, Blood 73:84-89 (1989); Tsuda, E., etal., Eur. J. Biochem. 
iS8:405-411 (1990)) and protect the hormone against clearance by the liver 
(Sasaki, H., et aL, J. Biol. Chem, 262:12059-12076 (1987)). thereby enabling 
the transit of the hormone from its site of production in the kidney to its target 

5 cells in the bone marrow (Spivak. J.L., et cL, Blood 73:90-99 (1989)). 

We expressed the wild type Epo in £. coli. Accordingly, the produced 
protein completely lacks sugar. The pET expression system was used and is 
detailed in the Methods section. IPTG induction of transformed 8L21 (DE3) 
bacterial strain rapidly results in a high level of expression of the poly-His Epo 

10 fusion protein (Figure 15A and 15B). After three hours of induction, we 
obtained a typical yield of -1 mg of total protein per ml of culture. 
However, the vast majority of the expressed protein was present in the 
inclusion bodies and therefore its solubilization and purification on the nickel 
beads was performed in 6 M guanidine-HCl. Oxidative refolding and factor 

15 Xa cleavage resulted in soluble forms (Figure 15C) and the in vitro biological 
acti>nty was tested on HDC57 cells. The cleaved £. coU recombinant ^o 
showed a notable decrease of the specific activity (10% less than the fully 
. glycosylated mammalian expressed protein), but was still able to maintain 
HCDS7 proliferation. The observed reduction of in vitro activity is likely to 

20 be due to improper refolding of the insoluble protein and impaired physical 
stability of the E. coU Epo as previously rqwrted (Naifai, et al., /. BioL 
Chem. 2d6:23022-23026 (1991)). However, the fact that the cofi Epo was 
still able to trigger HCD57 growth indicated an overall preservation of die 
molecular stnicture. The uncleaved fuaon prcKdn, vddi 10 His readues at die 

25 N-terminus, exhibited a 67% loss in biological activity vAnsn coiiq>ared to die 
cleaved ptotdn. Thus, die addition of a 20-residue sequence to die N- 
terminus partially Inhibited die biological activity. 
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Discussion 

Currently the accrual rate of new protein sequences through gene 
cloning far outstrips the rate of determination of three dimensional structure. 
Epo is among a large number of biologically important proteins which have 
5 not yet been analyzed by X-ray diffraction or NMR. The problem is simplified 
by cumulative evidence that the structures of most proteins are likely to be* 
variations on existing themes (Levitt, M., et ai, Afamrf 267:552-558 (1976)). 
Indeed, as mentioned above. Epo appears to share common structural features 
with a large group of cytokines (Parry, D.A.D., et aL, J. MoL 

10 Recognition 7:107-110 (1988); Bazan, R, Immunology Today 77:350-354 
(1990); Manavalan, P., era/., J. Protein Qiem. 77:321-331 (1992)). 

Computer-based prediction of structure can be reduced to a three stage 
process: secondary structure is predicted from the primary amino acid 
sequence and, when available, optical measurements. Analysis of ^ by 

15 circular dichioisra reveals about 50% a helix and no detectable /3 sheet (Lai, 
P.-H.. etd., J. Biol Chem. 267:3116-3121 (1986); Davis, J.M., etal., 
. Bfoctenw//y 26:2633-2638 (1987)). Wth the kn^ 
secondary structural elements are then packed into a set of alternative tertiary 
struttuies. The munber of plaimble arrangements can be reduced by 

20 empirical knowledge of prefened helix-helix packii^ geometries and Ae need 
for globular structure to form a hydrophobic core. The putadve textiaiy 
structure is then refined by standard force field calculations. Since there are 
a huge number of alternate tertiary structures, the availability of 
experimoually determined structure of a homologous protein is cridcally 

25 important. Thus the predicted model of Epo structure gains considerable 
validity by knowledge of die strucnires of GH (Abdel-Meguid, S.S., er al., 
hvc. Nail. Acad. Sd. USA W:6434^37 (1987); deVos, A.M., etal.. 
Science 255:306-312 (1992)) and lU (Redfield, C, etal.. 
Biochemistry 30:11029-11045 (1991): Powers, R., et aL, 

30 Science 256:1673-1677 (1992)). 



wo 94/24160 



PCTAJS94/04361 



.63- 

We have tested the predicted four anti-parallel a-helical bundle 
structure by means of site-directed mutagenesis. Deletions within predicted 
a helices would be expected to destabilize tertiary structure, whereas deletions 
or insertions in non-helical segments should be permitted unless they impose 

5 undue strain on the structure. For example, a deletion in an overhand inter- 
helical loop may result in insufficient length to connect the two helices. 
Results that we have obtained on muteins produced in mammalian (Cos?) cells 
are summarized in Figure 16. Our measurements of the quantities of 
processed mutein by RI A may underestimate the true amount of secreted ^o. 

10 Even a small deletion or insertion can result in a conformational change that 
may lead to impaired binding by our polyclonal antibody, raised against native 
human Epo. Thus the values for specific activity (biologic activity/RIA) that 
we tepon must be regarded as approximations. This caveat notwidisianding, 
our mutagenesis results are in good agreement witii our proposed four 

15 o-helical model of crytiinvoietin. The proper folding of ¥fo into its native 
tertiary structure is necessary for stability and biological function. Mutdns 
witii short deletions inridc predicted a-helices were not processed and cxMbited 
no biological activity. In contrast, when deletions were created in predicted 
inteiconnecting loops, secreted proteins were detected, to varying degrees, 

20 botii by radioimmunoassay and bioassay. Furtiiennore, additions or deletions 
at die N or C-termim did nm markedly impair die secretion and the biological 
activity of tiie Epo protein. Moreover, mutation? at Cyi29 and Cys33 showed 
Uiat die small disulfide loop is not critical for biological activity. In order to 
delineate Epo's functionally important residues involved in the direct binding 

25 onto die Epo recq)tor, we have prepared and tested a series of amino acid 
rq)lacements on tiie surfaces of tiie predicted a helices. These experiments 
are described in a Example III. 
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EXAMPLE III 

Erythropoietin Structure-Function Relationships: 
Muteins That Test Functionally Important Domains 

Introduction 

5 The Epo receptor is a member of an ever enlarging family of cytokine • 

receptors (Bazan. F.. Biochm. Bhphys. Res, Commun. 764:788-796(1989)). 
In like manner, Epo shares weak sequence homology with other members of 
a family of cytokines which also include growth hormone, prolactin, IL.2, 
lL-3, lL-4. IL-5. IL-6. IL-7. G-CSF, GM-CSF, M-CSF, oncostatin M, 

10 leukemia inhibitory factor and ciliary neurottophic factor (Parry. D.A.D. 
etal., J. Mol. Recognition i:107-110 (1988); Bazan F., Immunology 
Today i7:350-354 (1990); Manavalan, P. et al.,J. Protein Own. 77:321-331 
(1992)). The genes encoding these proteins have similar numbers of exons as 
well as a clear relationship between intron-exon boundaries and predicted a- 

15 helical bundles. Such theoretical models of the structures of human growth 
, hormone (Cohen, F.E. etal.. Proteins: Strua. Func. Genet. 2:162-166 
(1987)) and IL4 (Curtis. B.M. etal.. Proteins: Struct. Func. Genet. 
ii:lll-119 (1991)) have been in remarkably good agreement with subsequent 
structures established by X-ray diffraction (HGH) (Abdel-Meguid, S.S. et al., 

20 Pmc. Natl. Acad. Sd. USA W:6434-6437 (1987); de Vos. A,M. et al., 
Sdence 255:306-312 (1992)) or by multidimensional NMR (IL4) (RedficM. 
C. etal., KtxhemistTy 5(^11029-11035 (1991); Pow«s. R. etal., 
Sdence 256:1673-1677 (1992)). Moreover, the ciystal structures of GM^SF 
(Diederichs, K. et al., Sdow* 254: 1779-1782 (1991)) and monomcric M-CSF 

25 (Pandit, J. etal.^ Sdence 258:1358-1362 (1992)) are also in reasonable 
agreanem their predicted strucmres. 

The structure of ^o has not yet been analyzed by either X-ray 
diffraction or by NMR. In order to begin to gain an understanding of 
structure-function relationships, we have taken a diree-pronged ^preach: 



wo 94/24160 



PCTAJS94/04361 



-65. 



a) Sequence determination of Epo from mamnials of different 
orders in order to establish regions of homology. This work is 
described in Example I. 

b) Construction of a model of the three-dimensional structure of 
Epo, followed by the design and preparation of muteins that test 
this model. These experiments are presented in Example IL 

c) Design and testing of muteins that provide information on 
receptor binding domain(s). This work is presented in this 
Example. 



10 MATERIALS AND METHODS 

Construction of the Mutagenic/Mammalian Expression Plasmid. A 
M13 plasmid, containing a 1.4 Kb £coRI-£coRI human Epo cDNA insert 
(XHEpo FL12) was a gift from Ctenetics Institute (Cambridge, MA) (Jacobs. 
K.etal., Nature 375:806-810 (1985)). A 943 bp EcdSl-Bglll fragment, 

15 corresponding to the complete coding sequence of the wild-type human 
erythropoietin, including untranslated regions 216 bp upstream and 183 bp 
downstream, was inserted into the mammalian expression plasmid pSGS 
(Stratagene ) (Sambrook, J. era/.. In "Molecular Cloning: A Laboratoiy 
manual', 2nd ed. Cold Spring Harbor. Laboratoiy, Cold Spring Haibor, 

20 N.Y., vol. 1, pp.L3M.41 (1989)) and named pSG5-Epo/WT, 

SiU-IHrtcied MiOagenesis was carried out according to the protocol 
described by Kunkel, T.A. etd., Meth. Entymol ISi'JiSJ-^Zl (1987). 
Single-scianded DNA was rescued from the pSGS-Epo^ phagemtd grown 
overnight in£. coir CJ236, in 2XYT media oomainiqg M13K07 helper phage 

25 Gn Vitrogen) and 70 Mg/ml kanamacyn (Sigma, St Louis^ Mo). The resulting 
uradl-oontauning ssDNA was used as a template for mutagene^s. 
Oligonucleoddes (24 to 46 mer) were synthesized with their 5' and V ends 
complementary to the target wild-type Epo sequence. A large varies of 
mutations (base substitudons, deletions and insertions) were created at the 
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centcrs of the mutagenic primer sequences. Annealing of the phosphorylated 
primers (10:1 oligonucleotidc/DNA template molecular ratio) was performed 
in 10 jil of a 20 mM Tris-HCL pH 7.4. 2 mM MgClj, 50 mM NaCl solution. 
The reactions were incubated at 80°C for 5 minutes, then allowed to cool 

5 slowly to room temperature over a one hour period. The DNA polymerization 
was initiated by the addition as a mix of 1 /il of lOX synthesis buffer (100 mM 
Tris-HCl pH 7.4 50. mM MgCl,, 10 mM ATP, 5 mM each dNTPs. 20 mM 
DTT). 0.5 (8 units) of T4 DNA ligase and I ft) (1 unit) of T4 DNA 
polymerase (Boehringer Mannheim). After 2 hours at 3rC, 80 /il of IX TE 

10 was added. 5 ^1 of the diluted reaction mix was used to transform competent 
E. cott NM522 (ung*. dut*). 

Since a 40-80% muution yield is normally obtained, 4 to 5 double- 
stranded plasmid clones from each reaction were sequenced widi 7-deaza- 
dGTP and Sequenase (U.S. Biochemical) (Sanger. F. et al. , ?roc. Nail. Acad. 

15 Sd. USA 74:5463-5467 (1977)). As a rule, the entire coding sequences of die 
Epo mutants were examined for the presence of unwanted mutation by 
sequendng or restriction enzyme mapping. 

Pmdttcdon of Wild-Type and Epo Mutdns in MammaUan Cells. 
Cos7 cells grown to i^rox. 70% confluen(7 were transfected wiUi 10 Mg 

20 recombinant plasmid DNA per 10 cm dish using die calcium phosphate 
precipitation protocol (Kingston. R.E. etal.. In "CSment Protocols in 
Molecular Biology". Green Publ. Assoc. & Wiley Intersdcnce, N.Y., vol. 1, 
H». 9.1.1-9.1.9 (1991))- As a control of tnmsfecdon effidency, in several 
experiments 2 ng of pCHllO plasmid (Pliarmacia) was co-transfected and ^- 

25 gaiamdi'a^ activi^ measured in die cytoplasnuc attracts. 

RNA Blot-Hybridaetbn Analysis. Total RNAs were prqKired ftom 
cultured Cos7 celU (Qiigwin. J.M. et al.. Biodtomsr/y 18:5294-5299 (1979)) 
and 2 /ig samples electrophoresed on a 1.1% agarose gel oontiumng 2.2 M 
formaldehyde. Transfer to (jeneScrecn Plus filters (New England Nuclear) 

30 and hybridization witii a «P-labeled wt Epo probe were carried out as 
previously described (Faquin, W.C. a al. , Blood 79:1987-1994 (1992)). 
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QuantUation of Transiently-Expressed Recombinant Epos. The 
amount of secreted protein in the supematants of transfected Cos7 was 
determined by a radioimmune assay (RIA). The RIA was performed using a 
high titer rabbit polyclonal antiserum raised against the human wild-type Epo 
5 and produced in our laboratory. '"I-labeled recombinant Epo was obtained 
from Amersham. Details of the protocol have been published elsewhere 
(Faquin, W.C. et al., Exp. Hematol. 21: 420426 (1993)). 

Bioassays. The dose-dependent proliferation activities of wt and ^ 
muteins were assayed in vitro using three different target cells: murine spleen 
10 cells, following a modification of the method of Krystal (Krystal, G.,Exp. 
Hematol. JJ:649-660 (1983); Goldberg. M.A. et al.. Proc. Natl. Acad. Sci. 
USA 8^:7972-7976 (1987)); murine Epo responsive murine cell line, 
developed by Hankins (Hankins. W.D. etal.. Blood 7ft 173a (1987)); and 
human EpoKlcpendent UT-7/Epo cell line, derived from the bone marrow of 
15 a patient with acute megakaiyoblastic leukemia (Komatsu. N et al.. Cancer 
Res. 57:341-348 (1991)). After 22 w 72 hours of incubation widi increaang 
amounts of recombinant proteins, cellular growtii was determined by pH]- 
tijymidine (New England Nuclear) uptake or by tiie colorimetric MTT assay 

(Sigma) (Yoshimura. A. etal., Proc. natl. Acad. Sd. USA «7i4139-4143 

20 (1990)): 

RESULTS AND DISCUSSION 

Our objective is to identify tiie ftjnctionally important domains on ti» 
Epo molecule by preparing muteins tiai have altered specific activity, i.e. 
significanUy high or low biological activity per unit mass of protein. Such 
25 muteins must satisfy tiie following criteria: efficient secretion by Cos? cells; 
full recognition (near normal binding affinity) by tiie polyclonal anti-Epo 
antiserum tiiat we use in our ladio-immune assay; preservation of tiie overall 
folding and diree dimensional structure. In order to meet tiiese criteria, tiie 
muteins diat we employ in tiiis study have only single amino acid 
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replacements. A comprehensive amino acid replacement scan of Epo would 
of necessity require the preparation and testing of a minimum of 166 muteins. 
Such an undertaking would be prohibitively labor intensive and costly. 

In order to devise a rational strategy to reduce the number of required 
5 replacements, we have taken advantage of information we have obtained 
employing scanning deletion muteins (See Example II). On the basis of its 
primary amino acid sequence and disulfide bonds Epo. in common with other • 
members of the cytokine family is predicted to have a four antiparallel 
amphipathic o-hclical bundle structure. We have shown that deletions in non- 
10 helical legions at the N-tcrminus. the C-terminus and in loops between helices 
result in ti»e formation of protein that is readily secreted from the cell and is 
biologically active. Tliese regions can be ruled out as functionally important 
domains such as the sites involved in the binding of Epo to its receptor. These 
results suggest that the functionally important contact sites are likely to reside 
15 on the predicted a helices. Furthermore, since ihc amphipathic helices are 
predicted to bind to one another along their hydrophobic surfiices. the 
biologically rdevant contact sites are likely to be residues predicted to be on 
' tiK external surfaces of the hdices. These considerations greatly icstrirt 

mimber of mutations needed to provide a comprehensive and informative body 
20 ofdata. 

• Because strucaiie-fiinction studies on growth hormone and IL4 have 
indicated that Helices A and D are involved in lecepior binding, most of our 
mutdns have been directed to these regitms. 

Results on HdU A muteins are shown in Figure 17. The top panel 

25 shows a helical wheel projection. Replacemem by alanine of the predicted 
external residues 13. 18. 20 and 21 have no significart impact on biological 
activity as determined by the relative incorporation of pH]-thymidine into Epo 
dependent HCD57 cells. However, the replacement of ArgM by Ala showed 
maricedly impaired biological activity. 

30 Figure 18, top panel, shows a helical wheel projection for Helix D. 

Nine of the nine predicted external residues were replaced by alanine. 
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Mutations at positions 136, 139 and 140 gave rise to proteins that were 
efficiently secreted and had full biological activity (Figure 18, bottom). In 
contrast Ala replacement of either Lysl40. Argl43. Serl46, Asnl47 or 
Lysl54 resulted in muteins with significantiy (3-fold) increased biological 
5 activity. Replacement of Asnl47 with Ala resulted in the greatest increase in 
biological activity. (Figures 23-25). 

Because the C-terminal boundary of Helix D is uncertain, we tested 
alanine replacements at positions 152. 154 and 156 which are predicted to be 
indieadjacentnon-helicalsegmentatthcC-termlnus. As shown in Figure 19. 
10 the Ala replacement at Lysl54 had slightly increased biological activity while 
the Ala replacement atTyrl56 had slightly decreased biological activity. 

The experiments reported above all involve testing the function of 
human Epo muteins on the mouse Epo responsive HCD57 cell line. Figure 20 
shows a comparative display of those muteins that appear to have Mgbfit 
15 biological activity than wild-type 1^. 

In order to assess the significance of these results we have also tested 
these muteins in a more physiologic system, mouse erythroid progenitors 
' prepared from the spleens of animals challenged with phenylhydrazine. This 
bioassay obviates the possible confounding effects of a malignant clonal cell 
20 line such as HCD57 which was derived from mouse erythroleukemia vim. 

. As shown in Figure 21, results with eiydiroid cells from mouse spleen 

were very similar to tiiose obtaned wiA HCD57 cells. 

In addition we have tested these muteins in an Epo responsive human 

cell line UT-7/Epo, derived from a patiem with acute leukemia. This cdl line 
25 provides the opportunity of examining the interaction of human muteins with 

the human Epo receptor. As shown of ordinary in Figure 22. these results 

closely parallel those witii tiie two mouse cell bioassays. 

It should be noted that in several of die dose response curves, at tiie 

highest concentrations of Epo tested, there is a significant drop-off in the 
30 incorporation of pH>thymidine. This is due to the fact that high levels of 

biolo^fy active Epo causes terminal maturation and cessation of ^vision. 
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Bccausc of this possibly confounding phenomenon, it is important to have, as 
in Ae experiments that we report here, a full dose response curve for each 
mutcin with an adequate number of data points. 

Taken together our results show that a restricted number of residues on 
5 Helix A and on Helix D are involved in Epo's biological activity. Further 
experiments are in progress to determine if the observed alterations in 
biological activity can be explained by parallel changes in the binding affinity 
u, the Epo receptor. In addition, at the sites that we have identified as 
involved in Epo's biological activity, we plan to make a set of addiuonal 
10 replacement muteins in onler to further refine the ligand-receptor interacuon. 
We are particularly interested in developing muteins with maximally increased 
biological activity, e.g. by preparing compound or multiplex replacements, 
each of which confers optimal enhancement of biological activity. Such a 
•superEpo" could have improved utility In the therapy of anemias. 



15 



Concludons 



We have produced a number of erythropoietin muteins comprising 
«ttivc erythropoietin witii one or more modifications which improves the 

biological activity. 

• All references mentioned herein are incorporated by lefenau* into this 

20 disclosure. 

Having now fully described the invention by way of illustiation and 
example for purposes of clarity and understanding, itwillbeappaienttothose 
of ordinary skill in tiie art tiuii certain changes and modifications may be 
practiced witiiin the scope of tiie invention, as limited only by the following 
25 dums. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Brigham and Women's Hoepital 
75 Francis Street 
Boston. MA 02U5 

INVENTORS: Boissel. Jean- Paul R. 
Bunn, H. Franklin 
Wen. Danyi 
Showers, Mark 0. 

(ii) TITLE OF INVENTION: Erythropoietin Muteins With Enhanced 
Activity 

(iii) NUMBER OF SEQUENCES: 59 

(iv) CORRESPONDENCE ADDRESS: , , „ 

(A) ADDRESSEE: Sterne, Kessler, Goldstein Cc Fox 

(B) STREET: 1100 New York Avenue, Suite 600 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U,S.A. 

(F) ZIP: 20005-3934 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

<D) SOFTWARE; Patent In Release #1-0, Version «1.2& 

(vi) CURRENT APPLICATION DATA: . 

(A) APPLICATION NUMBER: (to be assigned) 

(B) FILING DATE: Herei*ith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/049.802 

(B) FILING DATE: 21-APR-1993 
<C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

• (A) NAME: Cinbala. Michele A. 
(B) REGISTRATION NUMBER: 33.851 
<C) REFERENCE/DOCKET NOMBERi 0627.336PC01 

(ix)' TELECCamUNICATION INFORMATION: 

(A) TELEPHONE: (202) 371-2600 

(B) TELEFAX: (202) 371-2540 



(2) INFORMATION FOR SEQ ID NO: I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) 8TRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
ATWGAATGAA SGC 

(2) INFORMATION FOR SEQ ID N0:2: 



13 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 



10 

AWGGTNSGGG 



(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

24 

TGCTTCTGCT ATCTTTGCTG CTGC 
(2) INFORMATION FOR SEQ ID N0:4: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 12.. 13 , ^ 

(D) OTHER INFORMATION: /labels Peptide 
• - /note- -The amino acid at position 12 nay 

ieoleucine. The amino acid at position X3 may 
also be aspartic acid." 

(xi) SEQUENCE 0ESCRIPT1C»1: SEQ ID H0:4: 

Asp Thr Lys Val Asn Phe Tyr Ala Trp Lye Arg Met Glu Val Gly 

(2) INFORMATION FOR SEQ ID H0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

25 

T6AAGTTTG6 CCGAGAAGTG 6AT6C 

(2) INFORMATION FOR SEQ ID N0:6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY; both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AAGAKGTACC TCTCCAGRAC TCG 
(2) INFORMATION FOR SEQ ID N0:7: 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH 1 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(XX ) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CTGCTCCACT CCGAACAMTC AC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CTGGAGTGTC CATGGGACAG 
(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DBSGRIPTIONs SEQ ZD R0:9: 
AOGCGCGGkG ATGGGG6T6C 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

TGGTAGCT6G 6GGTGGGGTG TGCACAC6GC AGCAGGATTG AATGAAGGCC AGGGAGGCAG 60 

CACCTGAGTG CTTGCATGGT TGGGGACAGG AAGGACGAGC TGGGGCAGAG ACGTGGGATG 120 

AAGGAAGCTG TCCTTCCACA GGCCACCCTT CTCCCTCCCC GCCTGACTCT CA6CCTGGCT 180 

ATCTGTTCTA GAATGTCCTG CCTGGCTGTG GCTTCTCCTG TCCCTGCTGT CGCTCCCTCT 240 
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GGGCCTCCCA GTCCTGGGCG CCCCACCACG CCTCATCTGT GACAGC 
(2) INFORMATION FOR SEQ ID NO; 11: 

(i) SEOireWCE CHARACTERISTICS: 

(A) LENGTH: 290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGTAGCTGT TGGGGTX3GGG TGCGCACGAA GCCGCGGGAT TGAATGAACG CGGAGGCAGA 
GCTGAGCGCT CCCAAGGTCC GGGTG6GGAG GGCTCACTGG GGGCTGC6CA GC6GGATGGG 
TGAACCTGCC GGTCCAAAGC A6CCCTTCTC CTGCGAGAAT GCACCTCATC CGCAGCCC6T 
CTCTCTTTCC TAGACT6TAC TCCGCTGCT6 CTGCTGCTGC TGTCTCTTCT GCTGTTTCCT 



€0 
120 
180 
240 
290 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
CTCGTAGTTG TTGGGTGGCG TGTGCACGCG GTCA6TGTAT TGAATGAAGG CAAAGGAGGC 
AGAGCCTGCG CGCTCGCAAG GTTGGGGTCG GAAACGGCTG GCTGGGGGCA AGACGGCCGG 
ATGGGTGAAC CTGTCCGTCT AAACCAACCC TTCTCCTGCA ATGCCCAGCC TCACATTCAG 
CCTGGCTCTC TTTCCTAGAA TGTCCTGCCC GGCTGCTTCT GCTATCTTTG CTGCTGCTTC 
CTCTGGGCCT CCCAGTCCTG GGCGCCCCCC CACGCCTCAT CTGTGACAGC 
{2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GTAGCTCGGG GGGTGGGGTG TGCACGCGGC CGCGGGATTG AATGAAGGCA AAGGAGGCAG 
AGCCTGAGCG CTCGCAAGGT TCGGGTCGGG AGGACTCGCG G66CCAGAGC A6CGGGATGG 120 
GTGAACGTGC AGGTCCAAAC CACCCCTTCT CCTGCAAGTC CCGGCCTCAC ACTCA6CCCA 180 
GCTCTCTCTC CTAGAATGTC TTCTGCTGCT TCTGCTGCCC TCCTTGCTGC TGCTTCCTCT 240 
GGGCCTCCCA GTCCTGGGCG CCCCCACACA CCTCATCTGT GACAGC 

(2) INFOfcMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



60 



286 



60 
120 
180 
240 
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CTGGGCCTCC CAGTCTTGGG CGCCCCCCCA CGCCTCATCT GTGACAGC 288 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

TGGGGGTGGG GTGTGCAGGC GGCAGCAGGA TTGAATGAAG GCA6AGGAGA CAGACCCTGA 60 

GCGCT6GGAA GGTTGGGGGC AGGAGCCACT AGCTGGGGGC AGAGGAGGG6 GATGCGTGAA 120 

CCTGCCCCTC CAAACCAACC CTCCCCTTCC CCTCCCTGCC TCACACTCAG CCCGGATCTC 180 

CTTTCCAGAA TGTCCTGCCC TGCTGCTTCT GCTGTCCCTG CTACTGCCTC CTCTGGGCCT 240 

CCCAGCCCTG GGCGCCCCTC CACGCCTCAT CTGTGACACC 280 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: eingle 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CAGGTCAGTT TGATTGGGTG GATGTGTGCC GCGAAGCAGC GGCCGGATTG AATGAAGGCA €0 

GGGGAGGCAG AACCTGAACG CTGGGAA6GT GGGGGTCGGG CGCGACTAGT TGG6GGCAGA 120 

GGAGCGGGAT GT6TGAACCT GCCCCTCCAA ACCCACACAG TCAGCCTGGC ACTCTTTTCC 180 

AGAATGTCCT 6CCCTGTTCC TTTTGCTGTC TTTGCTGCT6 CTTCCTCT66 ACCTCCCAGT 240 

CCTGG6C6CC CCCCCTCGCC TCATTTGTGA CA6C 274 
(2) . INFORMATION FOR SBQ ID HO: 16: 

(i) 6EQDEHCB CKARACTBRISTICS : 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY t both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

TGGTAGTTTG GCGGGTGGTG TGTGCTCACC GCGGCGGCGG CGGGGATTGA ATGAAGGCCA 60 

A6GAGGCAGA ACCCGAGCGC TGGGAAG6TT CGGGGTGGGA GCGACAAGCT GGGGGCAGA6 120 

GAACGGGATG T6TGAACCTG CTCCTCCAAA CACACTCAGC CTGGCACTCT TTTCTAGAAT 180 

GTCCTGCCCT GCTGCTTCTG CTATCTTTGC TCKTTGCTTCC CCT(3G6CCrC CCA6TCCTG6 240 

G06CCCCCCC T06CCTCATC TGTGACAGC 269 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 baec pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGTAGTTTGG CGGGTGGGGT GTGCTCACCG CGGCGGCGGG GGGGATTGAA TGAAGGCCAA 
GGAGGCAGAA CCCGAGCGCT GGGAAGGTTC GGGGTGGGAG CGACAAGCT6 GGGGCAGAGG 
AACGGGATGT GTGAACCTGC TGCTCCAAAC ACACTCAGCC TGGCACTCTT TTCTAGAATG 
TCCTGCCCTG CTGCTTCTGC TATCTTTGCT GCTGCTTCCC CTG6GCCTCC CAGTCCTG6G 
CGCCCCCCCT CGCCTCATCT GTGACAGC 
(2) INFORMATION FOR SEQ' ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQOENCE DESCRIPTION: SEQ ID N0:18: 
CGGTOGCTTG GGGGGTGOGC TGTGCAGCGC GGAGGGATTG AATGAAGGCC ACTCAGGCAG 
AGCCTAAGC31 ATTGCAA6GT CGGGGTCAGC AGAGACTATA AGGGCAGAGG GGTCTCGCT6 
A6CCAACCGG CCCCTGCAGT CCCACGGCCC CCTCCCCTCT CGGCCTCACA CTCAGCCTGC 
CTTTCTTCCA GAACGTCCCA CCCTGCTGCT TTTACTCTCC TTGCTACTGA TTCCTCTGGG 
CCTCCCAGTC CTCTGTGCTC CCCCACGCCT CATCTGCGAC AGT 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i)' SEQUENCE CHARACTERISTICS t 

(A) I^GTH: 283 baee pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:l9t 
ATCK^TGGCTT GGGGGGTGGT GTGTGCAGCG CCGG<3AGATA GAATGAAGGC CACTCAGGCA 
GAGCCTAAGC AGTTGCAA6G TC0GGGTCA6 CAGAGACTGG AAGGGCACA6 6A6CCTCGCT 
GTGCAAACC6 GTCTCTGGTC 6TCCCACQGC CCTCCCCTCC CAGCCTCACA CCCA6CCT6C 
CTTTCTTCTA 6AACGTCCAA CCCTGCTGCT TTTGCTGTCT TTGCTCCTGC TTCCCCTGGG 
CCTCCCAGTC CTCTGCGCTC CCCCACGCCT CATCTGCGAC AGC 
(2) INFORMATION FOR SEQ ID N0:20: 
(i) SEQUENCE CHARACTERISTICS: 



60 
120 
180 
240 
268 



60 
120 
180 
240 
283 



60 
120 
180 
240 
283 
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(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
GGGTGGTGGC GATGAATGAA GCAGCAGCTA GGTGGGGGGC GGTGAAGTAC AGCCGTCTCA 60 
GAGTGTCTTC TCTCTGCCCT GGCCTCCCAG CTGGCCCCCC CTCATTGGAC AG 112 
t2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

AGGCGCGGAG ATGGGGGTGC ACGAATGTCC TGCCTGGCTG TGGCTTCTCC TGTCCCTGCT 60 

GTCGCTCCCT CTGGGCCTCC CAGTCCTGGG CGCCCCACCA CGCCTCATCT GTGACAGCCG 120 

AGTCCTGGAG AGGTACCTCT TGGAGGCCAA GGAGGCCGAG AATATCA06A C6GGCTGTGC 180 

TGAACACTGC AGCTT6AATG AGAATATCAC TGTCCCAGAC ACCAAAGTTA ATTTCTATGC 240 

CTGGAAGAGG AT6GAGGTCG G6CAGCAG6C CGTAGAAGTC TOGCAGGGCC TGGCCCTGCT 300 

GTCGGAAGCT GTCCTGCGGG GCCAGGCCCT GTTGGTCAAC TCTTCCCAGC CGTGGGA6CC 360 

cetGCAGCTG CATGTGGATA AAGCCGTCA6 TG6CCTTCGC AGCCTCACCA CTCTGCTTCG 420 

CGCTCTGGGA GCCCAGAAAG AA6CCATCTC CCCTCCAGAT GCGGCCTCAG CTGCTCCACT 480 

CC6AACAATC ACT6CTGACA CTTTCCGCAA ACTCTTCCGA GTCTACTCCA ATTTCCTCCX5 540 

GGGAAAGCTG AAGCT6TACA CAGGGGAGGC CT6CAGGACA GGGGACAGAT GACCAGGTGT 600 

CTCCACCIG6 GCATATCCAC CACCTCCCTC ACCAACATTG CTTGTGCCAC ACCCTCCCCC 660 

GGCCACTCCT CAACCCCCTC GAGGGGCTCT CAGCTCAGC6 CCAGCCTGTC CCATGGACAC 720 

725 

TCCAG 

(2) INFORMATION FOR SBQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEOOBNCE DESCRIPTION: SEQ ID NO: 22: 

ACGAATGTCC TGCCTGGCTG TGGCTTCTCC TGTCTCTCGT GTCGCTCCCT CTGGGCCTCC 60 

CAGTCCCG66 CGCCCCACCA CGCCTCGTCT GTGACAGCCG AGTCCTGGAG AGGTACCTCT 120 

TGGAGGCCAA GGAGGCCGAG AAT6TCACGA TGGGCTGTTC CGAAAGCTGC A6CTTGAATG 180 
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AGAATATCAC CGTCCCAGAC ACCAAAGTTA ACTTCTATGC CTGGAAGAGG ATAGAGGTCG 240 

GGCAGCAGGC TGTAGAAGTC TGGCAGGGCC TGGCCCTGCT CTCAGAAGCT GTCCTGCGGG 300 

GCCAGGCCGT GTTGGCCAAC TCTTCCCAGC CTTTCGAGCC CCTGCAGCTG CACATGGATA 360 

AAGCCATCAG TGGCCTTCGC AGCATCACCA CTCTGCTTCG GGCGCTGGGA GCCCAGGAAG 420 

CCATCTCCCT CCCAGATGCG GCCTCGGCTG CTCCACTCCG AACOITCACT GCTGACACTT 480 

TCTGCAAACT CTTCCGAGTC TACTCCAATT TCCTCCGGGG AAAGCTGAAG CTGTACACGG 540 

GGGAGGCCTG CAGGAGAGGG GACAGATGAC CAGGTGCGTC CAGCTGGGCA CATCCACCAA 600 

CTCCCTCACC AACACTGCCT GTGCCACACC CTCCCTCACC ACTCCCGAAC CCCATCGAGG 660 

681 

CGCTCTCAGC TAAGCGCCAG C 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:23: 

CCGAACGTCC CACCCTGCTG CTTTTACTCT CCTTGCTACT GATTCCTCTG GGCCTCCCAG 60 

TCCTCTGT6C TCCCCCACGC CTCATCTGCG ACAGTCGAGT TCTGGAGAGC TACATCTTAG 120 

AGGCCAAGGA GGCAGAAAAT GTCACGATGG GTTGTGCAGA AGGTCCCAGA CTGAGTGAAA 180 

ATATTACAGT CCCAGATACC AAAGTCAACT TCTATGCTTG GAAAAGAATG GA6GTG6AAG 240 

AACAGGCCAT AGAAGTTTGG CAAGGCCTGT CCCTGCTCTC AGAAGCCATC CTGCAGGCCC 300 

AGGCCCTGCT AGCCAATTCC TCCCAGCCAC CAGAGACCCT TCAGCTTCAT ATAGACAAAG 360 

CGATCAGTOG TCTACGTAGC CTCACTTCAC TGCTTCGGGT ACTGGGAGCT CAGAAGGAAT 420 

TGATGTC6CC TCCAGATACC ACCCCACCTG CTCCACTCCG AACACTCACA GTGGATACTT 480 

TCTGCAAGCT CTTCCGGGTC TACGCCAACT TCCTCCGGGG GAAACTGAAG CTGTACACGG 540 

6AGAGGTCTG CAGGAGAGGG GACAGGTGAC ATGCTGCTGC CACCGTGGTG GACCGACGAA 600 

CTTGCTCCCC GTCACTCTGT CATGCCAACC CTCCACCACT CCCAACCCTC ATCAAACGGG 660 

TCATTACCTT CTTACCAGT ^'^^ 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 base pairs 

(B) TYPBt nucleic acid 
iC) STRANDEDNESS: aingle 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCGAACGTCC CACCCTGCTG CTTTTACTAT CCTTGCTACT GATTCCTCTG GGCCTCCCAG 60 
TCCTCTGCGC TCCCCCACGC CTCATTTGCG ACAGTCGCGT TCTGGAGAGG TACATCTTGG 120 
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AGGCCAAGGA GGCAGAAAAT GTCACAATGG GCTGTGCAGA AGGTCCCAGA CTGAGTGAGA 180 

ATATTACCGT CCCAGATACC AAAGTCAACT TCTACGCTTG GAAAAGAATG AAGGTGGAAG 240 

AACAGGCTGT AGAAGTTTGG CAAGGCCTCT CTCTGCTCTC AGAAGCCATC CTGCAGGCCC 300 

AGGCrCTGCA GGCCAATTCC TCCCAGCCAC CAGAGAGTCT TCAGCTTCAT ATAGACAAAG 360 

CCATCAGTGG GCTACGTAGC CTCACTTCAC TGCTTCGCGT GCTGGGAGCT CAGAACGAAT 420 

TGATGTCGCC TCCAGACGCC ACCCAAGCCG CTCCACTCCG AACACTCACA GCGGATACTT 480 

TCTGCAAGCT CTTCCGGGTC TACTCCAACT TCCTCCGGGG -GAAACTGAAG CTGTACACGG 540 

GGGAGGCCTG CAGGAGAGGG GACAGGTGAC CTGCCACTGC CGTGTACCCG CCAACTCGCT 600 

CACCGTCACT GTGTCACCCC AACCCTCCAC CACTCCCAAC CCTCATCAAA CGGGGTTGTT 660 

TGTTACCTTC TTACCGGC ^''^ 
(2) INFORMATION FOR SEQ ID N0:2S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GCGACTGTAC TCC6CTGCTG CTGCTGCTGC TGTCTCTTCT GCTGTTTCCT CTGGGCCTCC 60 

CAGTCTTGGG CGCCCCCCCA CGCCTCATCT GTGACAGCCG AGTCCTGGAG AGGTACATCC 120 

TGGAGGCCAG GGAGGCC6AA AATGCCACGA TGGGCTGTGC AGAAGGCTGC AGCTTCAGTG 180 

AGAATATCAC TGTCCCAGAC ACCAAGGTTA ACTTCTATGC CTGGAAGAGG ATGGAGGTCC 240 

AGCAGCAGGC TCTGGAAGTC TGGCAGGGCC TGGCTCTGCT CTCAGAAGCT ATCTTTCGGG 300 

GCCAGGCCCT ACCGGCCAAC GCATCCCAGC CATGCGAGGC CCTGCGGTTG CACGTGGACA 360 

AAGCTGTCAG CGGCCTCCGC AGTCTCACCT CCCTGCTTCG GGCGCTGGGA GCCCAGAAAG 420 

AAGCCATCCC' CCTTCCAGAT GCAACCCCCT CCGCAGCCCC ACTCCGAATA TTCACTGTTG 480 

ATGCTTTGTC CAAGCTCTTC CGAATCTACT CCAATTTCCT GAGGGGAAAG CTGACGCTGT 540 

ACACAGGGGA GGCCTGCAGG AGAGGGGACA GGTGACCCAG TGCTATCACC CCGOGCACGT 600 

CCATCACCTC GCTCACCATC ACTGCCTACG CCATGCCTTC CACGCCGCCA CTCCCAACCC 660 
CTGTCGACGA CGGGCCACCA GCTCAGT 
(2) INFORMATION FOR SEQ ID N0:26: 

(1) SEQUENCE GKARACTBRISTICS: 

(A) LENGTH: 688 base, pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0j26; 
GAATGTCCTG CCCGGCTGCT TCTGCTATCT TTGCTGCTGC TTCCTCTGGG CCTCCCAGTC 60 
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(xi) SBQDENCE DESCRIPTION: SEQ ID NO:27: 
GAATGTCCTG CCCTGCTGCT TCTGCTATCT TTGCTGCTGC TTCCCCTGGG CCTCCCAGTC 
CTGGGCGCCC CCCCTCGCCT CATCTGTGAC ACCCGAGTCC TGGAGAGGTA CATTCTGGAG 
GCCAGGGAGG CCGAAAATGT CACGATGGGC TGTGCTGAAG GCTGCAGCTT CAGTGAGAAT 
ATCACTGTCC CAGACACOUl GGTCAACTTC TATACCTCGA AGAGGATGCA CGTCGGGCAC 
CAG6CT6TG6 AAGTCTGGCA GGGCCTCGCC CTGCTCTCA6 AAGCCATCCT GCGGGGCCAG 
GCCCTGCTGO CCAACTCCTC CCACCCATCT GAGACCCTGC AGCTGCATGT GGATAAAGCC 
GTCAGCAGC^ TGCGCAGCCT CACCTCCCTG CTTCGG6CAC TGGGJIGCCCA GAAGGAAGCC 
ACCTCCXTO: OVGAGGCaAC CtCTGCTGCT CCACTCCGAA CATTCACIGT CGATACTTTG 
TGCmCTTT TCCGAATCTA CTCCAACTTC CTGCGGGGAA AGCTGACGCT GTACACAGGG 
GAGGCCT6CC GAAGAGGAGA CAGGTGACCA GGTGCTCCTA CCCCGGGCAT GTCCACCACC 
TCACTTACCA CCAATGCCTG TGCCACGCCC TCTGCACCAC CACTCCTGAC CCCTGTCGGG 
GGTGATCAGC TCAGCACCAG C 

(2) INFORMATION FOR SBQ ID NO:28: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE; amino acid 
(0) TOPOLOGY: both 



120 
180 
240 



CTGGGCGCCC CCCCACGCCT CATCTGTGAC AGCCGAGTCC TGGAGAGGTA CATCTTGGAG 
GCCAAGGAGG GCGAAAATGC CACGATGGGC TGTGCCGAAA GCTGCAGCTT CAGTGAGAAT 
ATCACTGTCC CAGACACCAA GGTTAACTTC TATGCCTGGA AGAGGATGGA GGTCCAGCAG 
CAGGCCATGG AGGTCTGGCA GGGTCTGGCC CTGCTCTCAG AAGCCATCCT GCAGGGCCAG 300 
GCCCTGTTGG CCAACTCCTC CCAGCCATCT CAGGCCCTGC AGCTGCATGT GGACAAAGCT 360 
GTCAGCGGCC TGCGCAGCCT CACCTCCCTG CTTCGGGCGC TGGGAGCCCA GAAGGAAGCC 420 
ATCCCCCTTC CAGACGCATC CCCCTCCTCT GCCACCCCAC TCCGAACATT TGCTGTTGAT 480 
ACTTTGTCCA AACTTTTCCG CAACTACTCC AATTTCCTGC GGGGAAAGCT GACGCTGTAC 540 
ACTGGAGAGG CCTGCAGGAG AAGGGACAGG TGACTTGGTG CTCCCACCCG GGGCATGTCC 
ACCACTTCGC TCACCACCAC TGCCTGTGCC ATGCCTTCTG CACCTCCACT CCCAACCCCC 
GCTGAGGGGC CATCAGCTCA GCGCCAGC 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: both 



600 
660 
688 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
681 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
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Leu Ser Leu Pro Leu Gly Leu Pro Val Leu Gly Ala 
20 25 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Gly Val Hie Glu Cys Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 
1 5 ^0 

Val Ser Leu Pro Leu Gly Leu Pro Val Pro Gly Ala Pro Pro 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 




Leu lie Pro Leu Gly Leu Pro Val Leu Cya Ala Pro 
20 25 



(2) INFORMATION FOR SEQ ID NO: 31: 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: both 



(xi)- SEQUENCE DESCRIPTIONS SEQ ID N0:31: 




Leu Leu Phe Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 30 



(2) INFORMATION FOR SEQ ID N0s32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Glu Cys Pro Ala Arg Leu Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 
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15 10 15 

Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Glu Cya Pro Ala Leu Leu Leu Leu Leu Ser Leu Leu Leu Pro Pro Leu 
1 5 10 

Gly Leu Pro Ala Leu Gly Ala Pro Pro 
' 20 25 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 25 amino acide 
(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Glu Cya Pro Ala Leu Leu Leu Leu Leu Ser Leu Leu Leii Leu Pro Leu 
1 5 10 " 

Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQOEMCE CHARACTERISTICS: 
(A> LENGTH: 25 amino acids 
(B) TYPE: amino acid 
' (D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:35: 

Glu Cya Pro Ala Leu Phe Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 
1 5 1« " 

Aap Leu Pro Val Leu Gly Ala Pro Pro 
^ 20 25 

(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
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Glu Cys Leu Leu Leu Leu Leu Leu Pro Ser Leu Leu Leu Leu Pro Leu 
1 5 10 15 . 

Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
1 S 10 15 

Leu Glu Ala Lye Glu Ala Glu Asn lie Thr Thr Gly Cye Ala Glu His 
20 25 30 

Cys Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
' 35 40 45 

Tyr Ala Trp Lye Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Val Asn Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu Hie Val Asp 
as 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
^ 100 105 110 

Gly Ala Gin Lys Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
115 120 125 

Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 ISO 155 160 

Cys Arg Thr Gly Asp Arg 

165 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
1 5 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met Gly Cys Ser Glu Ser 
20 25 30 

Cys Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 
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Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 «0 

Gin Gly Leu Ala Leu Leu Scr Glu Ala Val Leu Arg Gly Gin Ala Val 
65 70 75 80 

Leu Ala Aan Ser Ser Gin Pro Phe Glu Pro Leu Gin Leu Hie Met Aop 
85 50 95 

Lva Ala He Ser Gly Leu Arg Ser He Thr Thr Leu Leu Arg Ala Leu 
^ 100 105 110 

Gly Ala Gin Glu Ala He Ser Leu Pro Asp Ala Ala Ser Ala Ala Pro 
lis 120 125 

Leu Arg Thr He Thr Ala Asp Thr Phe Cya Lys Leu Phe Arg Val Tyr 
130 135 140 

Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys 
145 150 155 160 

Arg Arg Gly Asp Arg 
165 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Ala Pro Pro Arg Leu Val Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

Leu Glu Ala Lya Glu Ala Glu Asn Val Thr Met Gly Cys Ser Glu Ser 
20 25 30 

Cya Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
' 35 40 45 

Tyr Ala Trp Lye Arg He Glu Val Gly Gin Gin Ala Val Glu Val Trp 
^ 50 55 €0 

Gin iSly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Val 
65 70 75 BO 

Leu Ala Asn Ser Ser Gin Pro Phe Glu Pro Leu Gin Leu His Met Aop 
es 90 95 

Lys Ala He Ser Gly Leu Arg Ser He Thr Thr Leu Leu Arg Ala I-eu 
100 105 110 

Gly Ala Gin Glu Ala He Ser Leu Pro Asp Ala Ala Ser Ala Ala Pro 

115 120 125 

Leu Arg Thr He Thr Ala Asp Thr Phe Cys Lys Leu Phe Arg Val Tyr 
130 135 140 

Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys 
145 150 155 160 

Arg Arg Gly Asp Arg 
165 
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(2) INFORMATION FOR SEQ ID NO:4 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Ala Pro Pro Arg Leu lie Cye Asp Ser Arg Val Leu Glu Arg Tyr He 
1 5 . ■ 10 15 

Leu Glu Ala Lye Glu Ala Glu Aan Val Thr Met Cly Cys Ala Glu Gly 
20 25 30 

Pro Arg Leu Ser Glu Aan He Thr Val Pro Aep Thr Lye Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Glu Glu Gin Ala He Glu Val Trp 
50 55 60 

Gin Glv Leu Ser Leu Leu Ser Glu Ala He Leu Gin Ala Gin Ala Leu 
65 70 75 80 

Leu Ala Aan Ser Ser Gin Pro Pro Glu Thr Leu Gin Leu Hia He Aap 
85 90 95 

Lvs Ala lie Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Val Leu 
^ 100 105 110 

Gly Ala Gin Lys Glu Leu Met Ser Pro Pro Asp Thr Thr Pro Pro Ala 
115 120 125 

Pro Leu Arg Thr Leu Thr Val Asp Thr Phe Cys Lys Leu Phe Arg Val 

135 140 



130 



Tyr Ala Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Val 
145 150 155 160 

Cys Arg Arg Gly Asp Arg 

(2) INFO»ATX0M FOR SEQ ID MO:4l2 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 
* (B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr He 
15 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met Gly Cys Ala Glu Gly 
20 25 30 

Pro Arg Leu Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Ala Trp Lye Arg Met Lys Val Glu Glu Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ser Leu Leu Ser Glu Ala He Leu Gin Ala Gin Ala Leu 
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is 70 75 ao 

Gin Ala Asn Ser Ser Gin Pro Pro Glu Ser Leu Gin Leu Hie lie Asp 
85 

Ly. Ala He Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Val Leu 
100 "0 

Gly Ala Gin Lys Glu Leu Met Ser Pro Pro Aep Ala Thr Gin Ala Ala 

115 120 

Pro Leu Arg Thr Leu Thr Ala Asp Thr Phe Cya Lya Leu Phe Arg Val 

13T) 135 "0 

Tyr ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 ISO 

Cys Arg Arg Gly Aep Arg 
IfiS 

(2) IMFORMATION FOR SEQ ID 110:42: 

(i) SEQUENCE CHARACTERISTICS : _ 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ala Pro Pro Arg Ueu He Cye Aap Ser Arg Val Leu Glu Arg Tyr lie 
1 S 10 " 

Leu Glu Ala Arg Glu Ala Glu Asn Ala Thr Met Gly Cye Ala Glu Gly 
20 25 30 

eye ser Phe Ser Glu Asn He Thr Val Pro Aep Thr Lye Val Aan Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gin Gin Gin Ala Leu Glu Val Trp 
50 55 ^0 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Phe Arg Gly Gin Ala Leu 
65 70 75 80 

Pro Ala Asn Ala Ser Gin Pro Cys Glu Ala Leu Ars Leu His Val Asp 
85 90 95 

Lye Ala Val Ser Gly Leu Arg Ser Leu Thr Ber Leu Leu Arg Ala Leu 
' 100 105 "0 

Gly Ala Gin Lys Glu Ala He Pro Leu Pro Asp Ala Thr Pro Ser Ala 
lis 120 125 

Ala Pro Leu Arg He Phe Thr Val Asp Ala Leu Ser Lys Leu Phe Arg 
130 135 140 

He Tyr Ser Asn Phe Leu Arg Gly Lys Leu Thr Leu Tyr Thr Gly Glu 
145 - 150 155 160 

Ala Cys Arg Arg Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr lie 
15 10 15 

Leu Glu Ala Lys Glu Gly Glu Asn Ala Thr Met Gly Cys Ala Glu Ser 
20 25 30 

eye Ser Phe Ser Glu Aan He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gin Gin Gin Ala Met Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Leu Gin Gly Gin Ala Leu 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Ser Glu Ala Leu Gin Leu His Val Asp 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala He Pro Leu Pro Asp Ala Ser Pro Ser Ser 

115 120 125 

Ala Thr Pro Leu Arg Thr Phe Ala Val Asp Thr Leu Cys Lys Leu Phe 
130 135 140 

Arg Asn Tyr Ser Asn Phe Leu Arg Gly Lys Leu Thr Leu Tyr Thr Gly 
145 150 155 160 

Glu Ala Cys Arg Arg Arg Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY t both 



(xD SEQOBNCE DESCRIPTION: SEQ ID 110:44: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr He 
15 10 15 

Leu Glu Ala Arg Glu Ala Glu Asn Val Thr Met Gly Cys Ala Glu Gly 
20 25 30 

Cys Ser Phe Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Thr Trp Lys Arg Met Asp Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Ser Glu Thr Leu Gin Leu His Val Asp 
85 90 95 
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Lya Ala Val Ser Ser Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 
100 105 11" 

Gly Ala Gin Lys Glu Ala Thr Ser Leu Pro Glu Ala Thr Ser Ala Ala 
115 120 

Pro Leu Arg Thr Phe Thr Val Asp Thr Leu Cya Lys Leu Phe Arg He 
130 135 140 

Tyr ser Asn Phe Leu Arg Gly Lya Leu Thr Leu Tyr Thr Gly Glu Ala 
145 150 1S5 "0 

Cya Arg Arg Gly Asp Arg 
165 

{2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
1 S 10 15 

Leu Glu Ala Lys Glu Ala Glu Aen He Thr Thr Gly Cya Ala Glu Hie 
20 25 30 

Cys Ser Leu Asn Glu Aan He Thr Val Pro Asp Thr Lya Val Aen Phe 
^ 35 40 45 

Tvr Ala Trp Lye Arg Met Glu Gly Val Gin Gin Ala Val Glu Val Trp 
50 55 «0 

Gin Gly Leu Ala Leu Leu Ser Glu Aia Val Leu Arg Gly Gin Ala Leu 
€5 70 '5 »w 

Leu Val Asn Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu Hie Val Asp 
85 90 95 

Lye Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 

Gly Ala Gin Lya Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
X15 120 125 

Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lye Leu Phe Arg Val 
X30 . 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lye Leu Lys Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

Cys Arg Thr Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 
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(xi) SEQUEa^CE DESCRIPTION: SEQ ID NO:46: 

Ser Arg Val Leu Clu Arg Tyr Leu Leu Glu Xaa Xaa Ala Lye 
1 5 10 

(2) INFORMATION FOR SEQ ID N0:47j 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID »0:47: 

Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser Glu Ala 
1 5 10 " 



Leu Arg 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 18 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Pro Leu Gin Leu Hie Val Asp Lya Ala Val Ser Gly Leu Arg Ser Leu 
1 S 10 

Thr Thr 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 amino acids 
• (B) TYPE: amino acid 
(D) TOPOLOCSnr: both 



(Xi) SBQUENCB DESCRIPTION: SEQ ID NOf49: 

Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val Tyr Ser Aan 
1 5 10 >5 

Phe Leu Arg Gly Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
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Glu Asn He Thr Thr Gly Cys Ala Glu Hie Cy8 Ser Uu Aen Glu Asn 
1 S 1° 

lie Thr Val Pro Aap Thr Lya Val Asn Phe Tyr Ala Trp Lys Arg Aan 

'in 25 



20 25 

Glu Val Gly Gin 
35 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: both 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: SI: 

Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Glu Ala lie Ser 

1 S 10 

Pro Pro Asp Ala Ala Ser Ala Ala Pro Leu Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Leu He Ser Glu 
1 5 10 * 

Glu Asp Leu Glu Ala lie Ser Pro Pro Asp Ala Ala Ser Aja Ala Pro 
20 25 J" 

Leu Arg 

(2) IHPORMATioN FOR SEQ ID 110:53: 

(ir SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:S3: 

Cys Arg Thr Gly Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID N0:S4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
Lys Aep Glu Leu 

a 

(2) INFORMATION FOR SEQ ID NO: 55: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: € amino acids 

(B) TYPE: amino acid 
ID) TOPOLOGY: both 



Ixi) SEQUENCE DESCRIPTION: SEQ ID N0:5S: 

Hie His His His His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Gly Hia Ser Ser Gly His He Glu Gly Arg His Met Ala Pro Pro 
i 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Val 'Leu Glu Arg Tyr Leu Leu Glu Ala Lya Glu Ala 
1 S 10 



(2) INFORMATION FOR SEQ ID 110:58: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Arg Thr He Thr Ala Asp Thr Phe Arg Lye Leu Phe Arg Val Tyr Ser 
15 10 15 

Asn Phe Leu Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:S9: 
Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys 
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Claims: 



1 
1 


1 . An erythropoietin mutein comprising native erythropoietin with 


L 


one or more of the following modifications: 


3 


W 


mniarpmpnt nf the smino acid at Dosition 20 with a first 


4 




ciihctitutfi amino Ecidi 


5 


W 


■vnionpmAnt nf fh^ amino acid at Dosition 49 with a 


£ 

0 




cp>pnn/l ciihctitiite aminD acid* 


7 


(c) 


•A.«lA/«AmaMt j\f tKA omirm ariH at nncitinn 7^ with 
rCpiaCCmcni OI uIC aminu OUIU ol puaiuun / J wiui o 


8 




third substitute amino acid; 


9 


(d) 


replacement of the amino acid at position 140 with a 


10 




fourth substitute amino acid; 


11 


(0 


replacement of the amino acid at position 143 with a 


12 




fifth substitute amino acid; and 


13 


(f) 


rq>lacement of die amino acid at position 146 with a 


14 




sixth substitute amino acid; 


IS 


(g) 


replacement of the amino acid at position 147 wiA a 


16 




seventii substitute amino acid; 


17 


(h) 


replacement of die amino add at position 1S4 widi a 


18 




eighdi substitute amino acid; 


19 


v^^iD said erythropoietin mutein exhibits enhanced biolo^cal activiQr 


20 


conq)ared to said native erythnq>oietin. 



1 2. The erytiiropoietin mutein of claim 1 conq)rising native 

2 erythrqx)ietin with one of said modifications. 

1 3. The erythropoietin mutein of claim 1 wherein said first, second, 

2 tiiird, founh. fifth, sixtii, seventii or eighdi substitute amino acids are selected 

3 from die group consisting of alanine, serine, tiireonine, and glycine. 
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1 4. The erythropoietin mutein of claim 3 wherein said first 

2 substitute amino acid is serine, and/or said second, third, fourth, fifth, sixth. 

3 seventh and/or eighth substitute amino acids are alanine. 

5. The erythropoiedn mutein of clam 2 wherein the amino acid at 
position 20 is replaced with the amino acid alanine. 

1 6. The erythropoietin mutein ofclaim 2 mrtierein the amino acid at 

2 position 49 is rq)laced with the amino acid serine. 

3 7. The erythropoietin mutein of claim 2 wherein the amino acid at 

4 position 73 is replaced with the amino acid glycine. 

5 8. The eiythiq)oietin mutein of claim 2 wherein die amino acid at 

6 poation 140 is replaced widi the amino add alanine. 

7 9. The erythixqwietin mutein of daim 2 wherein die amino add at 

8 position 143 is replaced widi die amino acid alamne. 

9 10. Theetyduopdetinmutdnofclaim2wher«ndieaminoaddat 

10 position 146 is rq)laced widi die anuno add alaiune. 

n. Tijeerydiropoietin mutein of daim 2 i«*ereind» amino add at 
position 147 is replaced widi die anuno add alaiune. 

1 12. The erytiiropoieun mutein of claim 2 wherein die amino acid at 

2 position 154 is replaced widi die amino add alaiune. 

1 13. A DNA molecule encoding die erytiiropoietin mutein of 

2 claim 1. 
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1 14. A recombinant DNA construct comprising the DNA molecule 

2 of claim 13. 

1 15. A vector comprising the DNA construct of claim 14 which is 

2 capable of expressing said erythropoietin mutein in a host cell. 

1 16. A host cell comprising the DNA molecule of claim 13 capable 

2 of expressing the erythn^oietin muiein encoded by said DNA molecule. 

1 17. The host cell of claim 16 wherein said host cell is capable of 

2 glycosylating said erydiropoietin mutein. 

1 18. The host cell of claim 17 wherein said host cell is a mammalian 

2 cell. 

1 19. The host cell of claim 18 wherein said host cell is selected ftom 

2 the group consisting of Chinese Hamster Ovary cells, Cos7 cells,Cosl cells, 

3 baby hamster Iddney cells, and CVl cells. 

1 20. A method of stimulating eryduocyie production in a subject 

2 wherein said mefliod CQn^)rises administering to said subject an efficacious 

3 amount of the erythropoietin mutein of claim 1. 

1 21. A method of stimulating erythrocyte production in a subject, 

2 said method comprising administering to said subject an efficacious amount of 

3 an erythropoietin mutein comprising native erythropoietin with the amino acid 

4 residue at position 143 replaced by an alanine residue, wherein said efficacious 

5 amount is less than the corresponding amount of native erythropoietin. 
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1 22. A method of inducing the proliferation of a cell culture 

2 responsive lo erythropoietin, wherein said method comprises contacting said 

3 cell culture with the erythropoietin mutein of claim 1. 

1 23. A method of inducing the proliferation of a cell culture 

2 responsive to erythropoietin, wherein said method comprises contacting said 

3 cell culture with an efficacious amount of an erythropoietin mutein comprising 

4 native erythrq)oietin with the amino acid residue at position 143 replaced by 

5 an alanine residue, wherein said efficacious amount is less than the 

6 corresponding amount of native erydiropoietin. 



1 24. The method of claim 18 or 19 wherein said cell culture 

2 comprises cells selected from the group consisting of murine spleen cells, - 

3 murine erythroleukemia cells, and human UT-7/Epo cells. 

1 25. A method for obtdiung erythropoietin muteins witii enhanced 

2 biological activity relative to native erydiropoietin, s^d mediod compriang the 

3 steps: 

4 (a) prq)aring erydiropoietin muteins comprising native erydirq)oietin 

5 with a angle amino acid substitution, said substitution occurring at a position 

6 selected ftom the group consisting of positions 48-52, IS 1-1G0« and positions 

7 predicted to reside on die external sur&ce of helices A and D; 

8 (b) assayiiig die biological acti^ty of die erydiropoietin muteins from 

9 . step (a) over a range of dosages: and 

10 (c) identifying diose erydiropoietin muteins widi enhanced biological 

1 1 activity relative to native erydiropoietin. 
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